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f PRELIMINARY AMENDMENT 

Box PATENT APPLICATION 
* pirector of Patents and Trademarks 
JJIVashington, DC 20231 

5ir: 

Prior to examination on the merits of this patent application, please amend the application 
as follows: 



IN THE SPECIFICATION 



Please amend the specification as follows: 

Page 8, line 3, after "3"' insert - (SEQ ID No. 1) -; 
line 14, change "25" to - 24 --; 
line 16, change "26" to - 25 --; and 
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line 17, change "27" to - 26 -. 
Page 20, line 33, change " reped" to -- replaced --. 
Page 21, line 35, after "3"' insert - (SEQ ID No. 2) -; 

line 37, after "3"' insert -- (SEQ ID No. 3) --; 

line 39, after "3"' insert - (SEQ ID No. 4) -. 
Page 22, line 9, after "3"' first occurrence, insert - (SEQ ID No. 5) -, 

and after "3"' second occurrence, insert - (SEQ ID No. 6) -. 
Page 27, line 18, after "nucleotide", insert - (SEQ ID No. 7) --; 

line 30, after "oligos:", insert - (SEQ ID No. 8) -. 
Page 29, line 32, after "sequences" insert -- (SEQ ID No. 9) --. 
Page 30, line 24, after "176" insert -- (SEQ ID No. 10) -. 
Page 31, line 20, after "3'" insert - (SEQ ID No. 1) -; 

line 22, after "gene III" insert - (SEQ ID Nos. 1 1 and 12) -; 

line 40, after "3'" insert - (SEQ ID No. 13) --; 

line 41, after "3'" insert - (SEQ ID No. 14) --; 

line 43, after "3'" insert - (SEQ ID No. 15) --. 
Page 32, line 16, after "3'" insert - (SEQ ID No. 16) -; 

line 18, after "3'" insert -- (SEQ ID No. 17) --. 
Page 38, line 1, after "3'" insert - (SEQ ID No. 18) -; 

line 2, after "3'" insert - (SEQ ID No. 19) -; 

line 7, after "3'" insert - (SEQ ID No. 20) -; 

line 8, after "3'" insert - (SEQ ID No. 21) -. 
Page 42, line 3, after "3'" insert -- (SEQ ID No. 22) -; 

line 4, after "3'" insert - (SEQ ID No. 23) -. 
Page 55, line 4, after "3'" insert - (SEQ ID No. 27) -. 
Page 56, line 48, after "3'" insert - (SEQ ID No. 28) -; 

line 50, after "3'" insert - (SEQ ID No. 29) --. 
Page 57, line 19, after "coding strand" insert -- ; SEQ ID No. 30-; 

line 20, after "strand" insert - ; SEQ ID No. 31--. 
Page 60, line 11, after "Sequence" insert - (SEQ ID Nos. 32-37) ~. 
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Page 61, line 11, after "Sequence" insert -- (SEQ ID Nos. 38-43) 

Please amend the specification by replacing the original Sequence Listing pages 
62-72 with the attached revised Substitute Sequence, pages 62-76. Please further amend 
the specification by renumbering pages 73-81 to follow consecutively after the Substitute 
Sequence Listing. 

IN THE CLAIMS 

Please cancel pending claim 1 without prejudice and insert therefor the following new 

claims: 

-53. A method for producing a product polypeptide, comprising the steps of: 

(1) culturing a host cell transformed with a replicable expression vector, the replicable 
expression vector comprising DNA encoding a product polypeptide operably linked to a control 
sequence capable of effecting expression of the product polypeptide in the host cell; wherein the 
DNA encoding the product polypeptide has been obtained by a method comprising the steps of: 

(a) constructing a family of variant replicable plasmids comprising a transcription 
regulatory element operably linked to a gene fusion encoding a fusion protein, wherein the gene 
fusion comprises a first gene encoding a polypeptide and a second gene encoding at least a 
portion of a phage coat protein, wherein the variant replicable plasmids comprise variant first 
genes encoding polypeptides; 

(b) transforming suitable host cells with the plasmids; 

(c) infecting the transformed host cells with an amount of helper phage encoding 
the phage coat protein sufficient to produce recombinant phagemid particles wherein no more 
than about 20 percent of the phagemid particles display one or more copies of the fusion protein 
on the surface of the phagemid particles; 

(d) culturing the transformed infected host cells under conditions suitable for 
forming recombinant phagemid particles containing at least a portion of the plasmid and capable 
of transfornning the host cells; 

(e) contacting the recombinant phagemid particles with a target molecule so that 
at least a portion of the phagemid particles bind to the target molecule; 
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(f) separating phagemid particles that bind to the target molecule from those that 

do not bind; 

(g) selecting one of the variant polypeptides displayed on a phagemid particle 
from step (f) as the product polypeptide; cloning DNA encoding the product polypeptide into the 
replicable expression vector; and 

(2) recovering expressed product polypeptide. 

54. The method of claim 53, farther comprising infecting suitable host cells with the 
phagemid particles that bind or do not bind and repeating steps (d) through (f). 

55. The method of claim 54, wherein the steps are repeated one or more times. 

56. The method of claim 53, wherein the expression vector farther comprises a secretory 
signal sequence. 

57. The method of claim 53, wherein the transcription regulatory element is a promoter 
system selected from the group consisting of lac Z, pho A, tryptophan, tac, lambdaPL, 
bacteriophage T7, and combinations thereof 

58. The method of claim 53, wherein the product polypeptide is a mammalian protein. 

59. The method of claim 58, wherein the product polypeptide is an antibody or binding 
fragment thereof 

60. The method of claim 59, wherein the product polypeptide comprises an antibody Fab 
portion, 

61 . The method of claim 58, wherein the product polypeptide is a human protein. 

62. The method of claim 53, wherein the product polypeptide comprises more than about 100 
amino acid residues. 
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63. The method of claim 53, wherein the phage coat protein is Ml 3 phage gene III coat 
protein or Ml 3 phage gene VIII coat protein. 

64. The method of claim 63, wherein the phage coat protein is M13 phage gene III coat 
protein. 

65. The method of claim 53, wherein the conditions are adjusted so that no more than 10% of 
phagemid particles display one or more copies of the fusion protein on the surface of the particle. 

66. The method of claim 53, wherein the amount of phagemid particles displaying more than 
one copy of the fusion protein on the surface of the particles is less than 20% of the amount of 
phagemid particles displaying a single copy of the fusion protein. 

67. The method of claim 53 wherein the number of phagemid particles displaying more than 
one copy of the fusion protein on the surface of the particle is less than about 10% of the amount 
of phagemid particles displaying a single copy of the fusion protein. 

68. The method of claim 53, wherein the number of phagemid particles displaying more than 
one copy of the fusion protein is less than 1% of the phagemid particles displaying a single copy 
of the fusion protein. 

69. The method of claim 53 , wherein the conditions are adjusted so that the number of fusion 
proteins per recombinant phagemid particle is about 0.1 (number of bulk fusion proteins/number 
of phagemid particles). 

70. The method of claim 53, wherein the gene fusion comprises a DNA triplet encoding an 
mRNA suppressible terminator codon between the first gene encoding a polypeptide and the 
second gene encoding at least a portion of a phage coat protein. 

71 . The method of claim 53, wherein the helper phage is selected from the group consisting 
of M13K07, M13R408, M13-VCS, and Phi XI 74. 
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72. The method of claim 53, wherein the helper phage is M13K07 and the coat protein is the 
Ml 3 phage gene III coat protein. 



73. The method of claim 53, wherein the host cells are E. coli. 

74. The method of claim 53, wherein the variant replicable plasmids do not contain a fiirther 
gene encoding a mature phage coat protein. 

75. The method of claim 53, wherein the variant replicable plasmids do not contain a 
complete phage genome. 

76. A product polypeptide produced by the method of Claim 53 . 

77. A method for producing a product polypeptide, comprising the steps of: 

(1) culturing a host cell transformed with a replicable expression vector, the 
replicable expression vector comprising DNA encoding a product polypeptide operably linked to 
a control sequence capable of effecting expression of the product polypeptide in the host cell; 
wherein the DNA encoding the product polypeptide has been obtained by a 
method comprising the steps of: 

(a) expressing in recombinant host cells variant gene fusions, each gene fusion 
comprising a first gene encoding a polypeptide and a second gene encoding at least a portion of a 
phage coat protein, the variant gene fusions encoding a family of variant fiision proteins; 

(b) culturing the recombinant host cells under conditions suitable for forming 
recombinant phagemid particles, wherein no more than a minor amount of the phagemid 
particles display more than one copy of the fusion protein on the surface of the phagemid 
particles; 

(c) contacting the recombinant phagemid particles with a target molecule so that 
at least a portion of the phagemid particles bind to the target molecule; 

(d) separating phagemid particles that bind to the target molecule from those that 

do not bind; 
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(e) selecting one of the variant polypeptides displayed on a phagemid particle 
from step (d) as the product polypeptide; cloning DNA encoding the product polypeptide into the 
repli cable expression vector; and 

(2) recovering expressed product polypeptide. 

78. The method of claim 77, wherein the product polypeptide is a mammalian protein. 

79. The method of claim 78^ wherein the product polypeptide is an antibody or binding 
fragment thereof. 

80. The method of claim 78, wherein the product polypeptide comprises an antibody Fab 
portion. 

8 1 . The method of claim 78, wherein the product polypeptide is a human protein. 

82. The method of claim 77, wherein the phage coat protein is Ml 3 phage gene III coat 
protein. 

83. The method of claim 77, wherein the conditions are adjusted so that no more than 10% of 
phagemid particles display one or more copies of the fusion protein on the surface of the particle. 

84. The method of claim 77, wherein the amount of phagemid particles displaying more than 
one copy of the fusion protein on the surface of the particles is less than 20% of the amount of 
phagemid particles displaying a single copy of the fusion protein, 

85. The method of claim 77, wherein the number of phagemid particles displaying more than 
one copy of the fusion protein on the surface of the particle is less than about 10% of the amount 
of phagemid particles displaying a single copy of the fusion protein. 
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86. The method of claim 77, wherein the number of phagemid particles displaying more than 
one copy of the fusion protein is less than 1% of the phagemid particles displaying a single copy 
of the fusion protein. 

87. The method of claim 77, wherein the conditions are adjusted so that the number of fusion 
proteins per recombinant phagemid particle is about 0.1 (number of bulk fusion proteins/number 
of phagemid particles). 

88. The method of claim 77, wherein the gene fusion comprises a DNA triplet encoding an 
mRNA suppressible terminator codon between the first gene encoding a polypeptide and 
the second gene encoding at least a portion of a phage coat protein. 

89. A gene fusion, comprising a first gene encoding a first polypeptide, a second gene 
encoding a second polypeptide, and a suppressible termination codon between or adjacent to the 
first and second genes. 

90. The gene fusion of claim 89, wherein the second polypeptide is at least a portion of a 
phage coat protein. 

91. The gene fusion of claim 90, wherein the suppressible termination codon is UAG, UAA 
or UGA. 

92. The gene fusion of claim 90, wherein the phage coat protein is a filamentous 
bacteriophage coat protein in or a portion thereof - 

REMARKS AND REQUEST FOR CONSIDERATION 
Upon entry of the Preliminary Amendment, claims 53-92 are pending in this case. 
Support for the new claims is found in the claims as originally filed and in the specification as 
filed. No new matter is believed to have been introduced. 
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The specification has been amended to correct a typographical error and to indicate the 

sequence identification numbers. 

Early entry of these amendments is requested. The inventors submit that this 
appUcation is now in compliance with the requirements of 37 CFR 1.821-1.825, and 
respectfiiUy request further processing of this application. 

New Claims 53-88 are supported by the specification as filed and are substantially 
directed to a method of producing a product polypeptide in which the DNA which encodes the 
product polypeptide is obtained by a phagemid display method of the invention. A method of 
identifying novel binding polypeptides is tiie subject of the grandparent application Serial No. 
08/463,587, now US Patent No. 5,821,047 and great grandparent application Serial No. 
08/050,058, now US Patent No. 5,750,373. See also US Patent No. 6,040,136 which claims 
phagemid vectors. The method of the present invention substantially includes the method steps 
of the grandparent application and fiirther includes additional steps of selecting the novel binding 
polypeptide, cloning DNA encoding the product polypeptide into an expression vector and 
producing the product polypeptide by culturing a host cell containing the expression vector. 
Since the grandparent claims were found to be allowable and the present claims substantially 
include the steps of the grandparent appUcation plus additional steps, the claims of the present 
apphcation are also beheved to be allowable. 

Support for the additional steps recited in tiie present claims is found in the specification 
as filed, for example, on page 1, lines 4-5; page 3, lines 38-39; page 9, lines 11-13 and the 
generally well-known methods of cloning, vector constriction, cell culture and polypeptide 
expression which are described on pages 1 1-20 and in the examples. 

New claims 89-92 are supported by page 15, line 29 over to page 16, line 7. 

The claims now pending in this application are believed to be in condition for allowance. 
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CONCLUSION 

In view of the amendments and remarks made above, the claims now pending in this 
application are believed to be in condition for allowance. Early notice to this effect is earnestly 
solicited. The Examiner is invited to telephone the undersigned at (650) 225-7467 if it is deemed 
helpful to clarify and advance prosecution. 

Respectfully submitted, 
GENENTECH, INC. 



Date: November 21, 2000 




Timotht R. Schwartz, Ph.D. 
Reg. No. 32,171 



1 DNAWay 

South San Francisco, CA 94080-4990 
Phone: (650) 225-7467 
Fax: (650) 952-9881 
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ENRICHMEKT METHOD FOR VARIAhTT PROTEiNS WITH ALTERED BINDING PROPERTIES 

RELDOFTHEINVEhfTiON 

5 This invention relates to the preparation and systematic selection of novel binding proteins having 

altered binding properties for a target molecule. Specifically, this invention relates to methods for prodixiing 
foreign polypeptides mimicking the binding activity of naturally occurring binding partners. In preferred 
embodiments, the invention is directed to the preparation of therapeutic or diagrostic compounds that mimic 
proteins or nonpeptidyi molecules such a hormones, onjgs and other smaH motecutes, particuiarty biologically active 
1 0 moieoies such as growth homione. 

BACKGROUND OF THE INVENTION 

Binding partners are substances that specificaiiy bind to one another, usually through noncovalenl 
interactions. Examples of binding partners include ligand-receptor, antibody-antigen, drug-target, and enzyme- 
substrate interactions. Binding partners are extremely useful in both therapeutic and diagnostic fields. 

1 5 Binding partr>ers have teen produced in tiie past by a variety of methods indudirkg; han/esting them 

from nature (e.g., antitxwjy-antigen, and ligand-receptor pairings) and by adventitious identification (e.g. 
traditional dajg development employing random screening of candidate molecules). In some instances these two 
approaches have been combined. For example, variants of proteins or polypeptides, such as polypeptide 
fragments, have been made that contain key functional residues that participate in binding. These polypeptide 

20 fragments, in turn, have been derivatized by methods akin to traditional drug development. An example of such 

derivitization v^uki include strategies such as cyclization to conformationatfy constrain a polypeptide fragment to 
produce a novel candidate bindir^ partner. 

The problem with prior art methods is that naturally occun-ing ligands may not have proper 
charactenstics for all therapeutic applications. Additwrially, polypeptide ligands may not even be available for 

25 some target substances. FurtiTermore, methods for making non-naturaliy occun-ing synthetic binding partners 
are often expensive and difficult, usually requiring complex synthetk; mettiods to produce each candidate. The 
inability to characterize the structure of the resulting candidate so that rational drug design methods can be 
applied for further optimization of candkjate molecules further hampers these methods. 

In an attempt to overcome these problems, Geysen (Geysen» immun. Today. 6:364-369 [1 985]); and 

30 (Geysen ef a/., Mol. Immun.. 23:709-715 [1986]) has proposed the use of polypeptide synthesis to provide a 

framewori^ for systematk; iterative binding partner kientification and preparation. According to Geysen et a/., 
Ibid, short polypeptides, such as dipeptkJes, are first screened for the ability to bind to a target molecule. The 
most active dipeptides are then selected for an additional round of testing compriang linking, to the starting 
dipeptide, an additional residue (or by internally modifying the components of the original starting dipeptide) and 

35 then screening this set of candidates for the desired activity. This process Is reiterated until the binding partner 
having the desired properties is identified. 

The Geysen et al metix>d suffers from the disadvantage ti^t tf)e chemistry upon which it is based, 
peptide synthesis, produces molecules with ill-defined or variable secondary and tertiary structure. As rounds of 
Iterative selection progress, random interactions accelerate among the various substituent groups of the 

40 polypeptide so that a true random populatbn of interactive molecules having reproducible higher order structure 
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becomes less and less attair^abie. For example, interactions between side chains of amino acids, which are 
sequentially widely separated but which are spatially neighbors, freely occur. Furthermore, sequences that do not 
fadlitate confonriationany stable secondary stnjctures provide complex peptkie-sidechain interactions which may 
prevent sidechain interactions of a given amino add with the target molecuie. Such complex interactions are 
5 fadlitated by the flexibility of the polyamide backbone of the polypeptide candidates. Additionally, candidates 
may exist in numerous conformations making it difficult to identify the confonner that interacts or binds to the 
target with greatest affinity or spedficity complicating rational drug design. 

A fml problem with the iterative polypeptide method of Geysen is that, at present, there are no 
practical methods with which a great diversity of different peptides can be produced, screened and analyzed. By 

1 0 using the twenty naturally occurring amino adds, the total number of all combinations of hexapeptides tf^t must 
be synthesized is 64,CX)0,000. Even having prepared such a diversity of peptides, there are no methods available 
with which mixtures of such a diversity of peptides can be rapidly saeened to select those peptides having a high 
affinity for the target molecule. At present, each "adherenr peptide must be recovered in amounts large enough 
to carry out protein sequendng. 

15 To overcome many of the problems inherent in the Geysen approach, biological selection and saeening 

was chosen as an alternative. Bologkal selections and screens are powerful tools to probe protein function and to 
isolate variant proteins with desirable properties (Shortie, Protein Engineering. Oxender and Fox, eds., A.R. Liss, 
Inc., NY, pp. 103-105 [1988]) and Bowie ef a/., SotSDJ^ 247:1306-1310 [1990)]. However, a given selection or 
screen is applicable to only one or a small number of related proteins. 

20 Recently, Smith and coworkers (Smith, Science. 228: 1315-1317 [1985]) and Parmley and Smith, Gene . 

73:305-318 [1985] have demonstrated that small protein fragments (10-50 amino adds) can be "displayed" 
effidently on the surface of filamentous phage by inserting short gene fragments into gene 11! of the fd phage 
("fusion phage"). The gene 111 minor coat protein (present in about 5 copies at one end of the virion) is important 
for proper phage assembly and for infection by attachment to the piii of E cofi (see Flasched et al , f^icrobiol. 

2 5 Rev.. 50: 401 -427 [1986]). Recently, "fusion phage" have been shown to be useful for displaying short mutated 

peptide sequences for identifying peptides that may react with antibodies (Scott el a/.. Science 249: 386-390, 
[1990] )and Cwirla et a/., Proc. Natl. Acad. U.S.A 87; 6378-6382, [1990]).or a foreign protein (Devlin et aL, 
Science . 249 : 404-406 [1990]). 

There are, however, several important limitations in using such "fusion phage" to identify altered 

3 0 peptides or proteins with new or enhanced binding properties. First, it has been shown (Parmley et ai. Gene . 73: 

305-318, [1988]) that fusion phage are useful only for displaying proteins of less than 100 and preferably less 
than 50 amino add rescues, because large inserts presumably disrupt the function of gene ill and therefore phage 
assembly and infectivity. Second, prior art mettiods have been unable lo select peptides from a librae having the 
highest binding affinity for a target molecule. For example, after exhaustive panning of a random peptide library 
35 with an anti-p endorphin monodonal antibody, Cwirla and co-workers could not separate moderate affinity 

peptides (Kd - 10 |iM) from higher affinity peptides (K<j -0.4 p.M) fused to phage. Moreover, the parent p- 
endorphin peptide sequence which has very high affinity (Kd - 7nM), was not panned from the epitope library. 

Ladner WO 90/02802 disdoses a method for selecting novel binding proteins displayed on the outer 
surface of cells and viral partides where it is contemplated that the heterotogous proteins may have up to 164 



amino acid residues . The method contemplates isolating and amplifying the displayed proteins to engineer a new 
family of binding proteins having desired affinity for a target molecule. More spedficaily. 1-adner discloses a 
"fusion phage" displaying proteins having "initial protein binding domains" ranging from 46 residues (crambin) to 
164 residues (14 lysozyme) fused to the M13 gene III coat protein. Udner teaches the use of proteins "no larger 
than necessary" because it is easier to arrange restriction sites in smaller amino acid sequences and prefers the 58 
amino acid residue bovine pancreatic trypsin inhibitor (BPTI). Small fusion proteins, such as BPTI, are preferred 
when the target is a protein or macromolecule, while larger fusion proteins, such as T4 lysozyme, are preferred for 
small target molecules such as steroids because such large proteins have clefts and grooves into which small 
molecules can fit. The preferred protein, BPTI, is proposed to be fused to gene III at the site disclosed by Smith 
e/ a/, or de la Cruz et al,, J. Biol. Chem.. 2S3: 431 84322 [1988], or to one of the termNi, along with a second 
synthetic copy of gene III so that "some" unaltered gene HI protein will be present. Ladner does not address the 
problem of successfully panning high affinity peptides from the random peptide library which plagues the 
biological selection and screening methods of the prior art 

Human growth hormone (hGH) partidpates in much of the regulation of normal human growth and 
development. This 22,000 dalton pituitary hormone exhibits a multitude of biological effects including linear 
growth (somatogenesis), lactation, activation of macrophages, insulin-like and diabetogenic effects among others 
(Chawla, R. K. (1983) Ann. Rgy. Mffd.M, 519; Edwards, C. K. et al. (1988) Science 25g^. 769; Thomer, M. 0., et al. 
(1988) J, Clin. Invefft.aL 745). Growth homione deficiency in children leads to dwarfism which has been 
successfully treated for more than a decade by exogenous administration of hGH. hGH is a member of a family of 
homologous hormones that include placental lactogens, prolactins, and other genetic and species variants or growth 
hormone (Nicoil, C. S., etaL, (1986) EntfoCTO RgvigWSZ. 169). hGH is unusual among these in that it exhibits broad 
species specificity and binds to either the cloned somatogenic (Leung, D. W., et ai, [1987] Nature 330. 537) or 
prolactin receptor (Boutin, J. M,ef al., [1988] ££; 51 69). The cloned gene for hGH has been expressed in a 
secreted fomi in Escherichia M (Chang, C. N., et al,, P987] fi£D£ 55. 189) and its DNA and amino acid sequence has 
been reported (Goeddel, et ai, 0979] Na t ure 231 544; Gray, et a/., [1985] Sana 32, 247). The three-dimensional 
stn)Cture of hGH is not available. However, the three-dimensional folding pattern for porcine growrth hormone 
(pGH) has been reported at moderate resolution and refinement (Abdel-Meguid, S. S., et ai, [1987] Proc. Natl. 
Acadr Sci. USft Si 6434). Human growth hormone's receptor and antibody epitopes have been identified by 
homolog-scanning mutagenesis (Cunningham etaU Science 2431 1330, 1989). The structure of novel amino terminal 
methionyl bovine growth hormone containing a spliced-in sequence of human growth homione including histidine 18 
and histidine 21 has been shown (U.S. Patent 4,880,910) 

Human growth hormone (hGH) causes a variety of physiotogical and metabolic effects in various animal 
models including linear bone growth, lactation, activation of macrophages, insulin-like and diabetogenic effects and 
others (R. K. Chawla etai, Annu. Rev. Med. 34, 519 (1983); 0. G. P. Isaksson etaL, Amu, Rev. Physbf. 47, 483 
(1985); C. K. Edwards etaL Science 239, 769 (1988); M. O. Thomer and M. L. Vance, J. Oin. invest, 82, 745 
(1 988): J. P. Hughes and H. G. Friesen, Ann. Rev. Physiol 47, 469 (1985)). These biological effects derive from 
the interaction between hGH and specific cellular receptors.. 

Accordingly, it is an otjject of this invention to provide a rapid and effective method for the systematic 
preparation of candidate binding substances. 
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It is arxjther object of this invention to prepare candidate binding substances displayed on surface of a 
phagemid particle that are conformalionaiiy stable. 

It is another object of this invention to prepare candidate binding substances comprising fusion proteins 
of a phage coat protein and a heterologous polypeptide where the polypeptide is greater than 100 amino acids in 
5 length and may be more than one subunit and is displayed on a phagemid partide where the polypeptide is encoded 
by the phagemid genome. 

It is a further object of this invention to provide a method for the preparation and selection of binding 
substances that is sufficiently versatile to present or display, all peptidyl moieties that could potentially 
participate in a noncovalent binding interaction, and to present these moieties in a fashion that is sterlcaliy 
1 0 confined. 

Still another object of the invention is the production of growth hormone variants that exhibit stronger 
affinity for growth hormone receptor and binding protein. 

it is yet another object of this invention to produce expression vector phagemlds that contain a 
suppressible termination codon functionally located between the heterologous polypeptide arxj the phage coat 

1 5 protein such that detectable fusion protein is produced in a host suppressor cell and only the heterologous 
polypeptide is produced in a non-suppressor host cell. 

Finally, it is an object of this invention to produce a phagemid particle that rarely displays more than one 
copy of candidate binding proteins on the outer surface of the phagemid particle so that efficient selection of high 
affinity binding proteins can be achieved. 

20 These and other objects of this invention will be apparent from consideration of the Inventbn as a whole, 

SUMMARY OF THE INVENfTlON 
These objectives have been achieved by providing a method for selecting novel binding polypeptides 
comprising: (a) constructing a repilcaWe expression vector comprising a first gene encoding a polypeptide, a 
second gene encoding at least a portion of a natural or wild-type phage coat protein wherein the first and second 

25 genes are heterologous, and a transcription regulatory element operably linked to the first and second genes, 

thereby forming a gene fusion encoding a fusion protein; (b) mutating the vector at one or more selected positions 
within the first gene thereby forming a family of related plasmids; (c) transforming suitable host cells with the 
plasmids; (d) infecting the transformed host cells with a helper phage having a gene encoding the phage coat 
protein; (e) culturing the transformed infected host cells under conditions suitable for forming recombinant 

3 0 phagemid particles containing at least a portion of the plasmid and capable of transforming the host, the 

conditions adjusted so that no more than a minor amount of phagemid particles display more than one copy of the 
fusion protein on the surface of the particle; (f) contacting the phagemid particles with a target molecule so that 
at least a portion of the phagemid particles bind to the target molecule; and (g) separating the phagemid 
particles that bind from those that do not Preferably, the method further comprises transforming suitable host 

35 cells with recombinant phagemid particles that bind to the target molecule and repeating steps (d) through (g) 
one or more times. 

Additionally, the method for selecting novel binding proteins where the proteins are composed of more 
than one subunit is achieved by selecting novel tending peptides comprising constructing a replicable expression 
vector comprising a transcription regulatory element operably linked to DNA encoding a protein of interest 



5 



containing one or more subunits, wherein the DNA erKX)ding at least one of the subunits is fused to the DNA 
encoding at least a portion of a phage coat protein ;mutating the DNA encoding the protein of Interest at one or 
more selected positions thereby forming a family of related vectors; transforming suitable host cells with the 
vectors; infecting the transformed host cells with a helper phage having a gene encoding the phage coat protein; 
5 cuituring the transformed infected host cells under conditions suitable for forming recombinant phagemid 

particles containing at least a portion of the plasmid and capable of translorming the host, the conditions adjusted 
so that no more than a minor amount of phagemid particles display more than one copy of the fusion protein on the 
surface of the particle; contacting the phagemid particles with a target motecule so that at least a portion of the 
phagemid particles bind to the target molecule; and separating the phagemid partides that bind from those that 
10 do not, 

Preferably In the method of this invention the plasmid Is under tight control of the transcription 
regulatory element, and the cuituring conditions are adjusted so that the amount or number of phagemid particles 
displaying more than one copy of the fusion protein on the surface of the particle is less than about 1%. Also 
preferably, amount of phagemid particles displaying more than one copy of the fusion protein is less than 10% the 

1 5 amount of phagemid particles displaying a single copy of the fusion protein. Most preferably the amount is less 

than 20%. 

Typically, in the method of this Invention, the expression vector will further contain a secretory signal 
sequences fused to the DNA encoding each subuntt of the polypeptide, and the transcription regulatory element 
will be a promoter system. Preferred promoter systems are selected from; Lac Z, XpL» TAC, T 7 polymerase, 

2 0 tr^tophan, and alkaline phosphatase promoters and combinations thereof. 

Also typically, the first gene will encode a mammalian protein, preferably the protein will be selected 
from; human grovi^ hormone{hGH), N-methionyl human growth hormone, bovine growth homione, parathyroid 
hormor>e, thyroxine, insulin A-chain, insulin B-ohain, proinsulin, relaxin A-chain, retaxin B-chain, prorelaxin, 
glycoprotein hormones such as follicle stimulating honmone{FSH), thyroid stimulating hormone(TSH), and 

25 leutinizing hormone(LH), glycoprotein hormone receptors, calatonin, glucagon, factor VIII, an antibody, lung 
surfactant, urokinase, streptokinase, human tissue-type plasminogen activator (t-PA), bombesin, factor iX, 
frirombin, hemopoietic growth factor, tumor necrosis factor-alpha and -beta, enkephatinase, human semm albumin, 
mullerian-inhibiting sifcstance, mouse gonadotropin-assodated peptide, a microbial protein, such as 
betalactamase, tissue factor protein, inhibin, activin, vascular endothelial growth factor, receptors for hormones 

30 or growth factors; Integrin, thrombopoietin, protein A or D, rheumatoid factors, nerve grovi^ factors such as 
NGF'p, platelet-growth factor, transforming growth factors (T6F) such as TGF-alpha and TGF-beta, insulin- 
like grovrth factor-l and -It, insulin-like growth factor binding proteins , CD-4, DNase, latency assodated peptide, 
erythropoietin, osteoinductive factors, interferons such as interferon-alpha, -beta, and -gamma, cotony 
stimulating factors (CSFs) such as M-CSF, GM-CSF, and G-CSF. interleukins (lis) such as lL-1, IL-2, IL-3, IL- 

35 4, superoxide dismutase; decay accelerating factor, viral antigen, HIV envelope proteins such as GP1 20, GP140, 
atrial natriuretic peptides A, B or C, immunoglobulins, and fragments of any of the above-listed proteins. 

Preferably the first gene will encode a polypeptide of one or more subunits containing more than about 
100 amino add residues and will be folded to form a plurality of rigid secondary stnjctures displaying a plurality 
of amino adds capable of interacting wlh the target. Preferably the first gene will be mutated at codons 
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corresponding to only the amino acids capable of Niteracting with the target so that the integrity of the rigid 
secondary structures will be preserved. 

Normally, the method of this invention will employ a helper phage selected from; M13KO7,M13R408, 
M13-VCS, and Phi X 174. The preferred helper phage is M13K07, and the prefen^d coat protein is the M13 
5 Phage gene III coat protein. The preferred host is E co//, and protease deficient strains of £ colL Novel hGH 
variants selected by the method of the present invention have been detected. Phagemid expression vectors were 
constmcted that contain a suppressible termination codon fi^rctionaily located between the nucleic acids encoding 
the polypeptide and the phage coat protein. 

BRIEF DESCRiPnON OF THE FIGURES 

1 0 RGURE 1, Strategy for displaying large proteins on the surface of filamentous phage artd enriching for 

altered receptor binding properties. A piasmid, phGH-M13glil was constructed that fuses the entire coding 
sequence of hGH to t\e carboxyl terminal domain of M13 gene III. Transcription of the fusion protein is under 
control of the lac promoter/operator sequence, and secretion is directed by the stil signal sequence. Phagemid 
particles are produced by infection with the "helper phage, M13K07, and particles displaying hGH can be 

1 5 enriched by binding to an affinity matrix containing the hGH receptor. The wild-type gene 111 {derived from the 
M13K07 phage) is diagramed by 4-5 copies of the multiple arrows on the tip of the phage, and the fusion protein 
(derived from the phagemid, phGH-M13glil) is indicated schematically by the folding diagram of hGH replacing 
the arrow head. 

FIGURE 2. Immunoblot Of whole phage partides Shows that hGH comlgrates with phage. Phagemid 

2 0 particles purified in a cesium chloride gradient were loaded into duplicate wells and electrophoresed through a 1 % 
agarose gel in 375 mM Tris, 40 mM glycine pH 9.6 buffer. The gel was soaked in transfer buffer (25 mM Tris, pH 
8.3, 200 mM ^ycine, 20% methanol) containing 2% SDS and 2% P-mercaptoethanol for 2 hours, then rinsed in 
transfer buffer for 6 hours. The proteins in the gel were then electroblotted onto immobilon membranes 
(Millipore). The membrane containing one set of samples was stained wtSn Coomassie blue to show the position of 

25 the phage proteins (A). The duplicate membrane was immuno-stained for hGH by reacting the membrane with 
polyclonal rabbit anti-hGH antibodies followed by reaction with horseradish peroxidase conjugated goat anti- 
rabbit IgG antibodies (B). Lane 1 contains the M13K07 parent phage and is visible only m the Coomassie blue 
stained membrane, since it lacks hGH. Lanes 2 and 3 contain separate preparations of the hormone phagemid 
particles which is visible both by Coomassie and hGH immuno-staining. The difference m migration distance 

30 between tie parent M13K07 phage and hormone phagemid partides reflects me different size genomes that are 
packaged within (8.7 kb vs. 5.1 kb, respectively). 

FIGURE 3. Summary diagram of steps in the selection process for an hGH-phage library randomized at 
codons 172, 174, 176, and 178. The template molecules, pH0415, containing a unkjue Kpnl restriction site and the 
hGH(Rl78G,l179T) gene was mutagentzed as described in the text and electrotransformed into £ coli strain 

35 WJM101 to obtain the initial phagemid library. Library 1. An aliquot (approximately 2%) from Library 1 was used 
directly in an initial selection round as described in the text to yield Library 1G. Meanwhile, double-stranded DMA 
(dsDNA) was prepared from Library I, digested with restriction enzyme Kpnl to eliminate template background, 
and electrotransformed into WJM101 to yield Library 2. Subsequent rounds of selection (or Kpnl digestion, 
shaded boxes) followed by phagemki propagation were canied out as indicated by the arrows, according to the 



procedure desaibed in the text. Four independent clones from Library 4G^ and four independent clones from 
Library 5G^ were sequenced by dideoxy sequencing. All of these ck>nes had the identical DNA sequence, 
corresponding to the hGH mutant (Glu 174 Ser. Phe 176 Tyr). 

RGURE 4. Structural model of hGH derived from a 2.8 A folding diagram of porcine growth hormone 
determined cryslallographically. Location of residues in hGH that strongly modulate its binding to the hGH- 
binding protein are within the shaded circle. Alanine substitutions that cause a greater than tenfold 
reduction{#), a four- to tenfold reduction {•), or increase (O), or a two- to fourfold reduction (•). in binding 
affinity are indicated. Helical wheel projections in the regions of a-heiix reveal their amphipathic quality. 
Blackened, shaded, or nonshaded residues are charged, polar, or nonpoiar, respectively. In helix-4 the most 
important residues for mutation are on the hydrophiiic face. 

FIGURE 5. Amino acid substitutions at portions 172, 174, 176 and 178 of hGH (The notation, e.g. 
KSYR, denotes hGH mutant 172K/174S/176Y/178R.) found after sequencing a number of clones from rounds 1 
and 3 of the selection process for the pathways indicated (hGH eiution; Glycine elutton; or Glycine elution after 
pre-adsorption). Non-functionai sequences (i.e. vector background, or other prematurely terminated and/or 
frame-shifted mutants) are shown as "NP. FijKtional sequences whk:h contained a non-sitent, spurious mutation 
(i.e. out&de the set of target residues) are marked with a ■+'. Protein sequences which appeared more than once 
among all the sequenced clones, but with different DNA sequences, are marked with a "#". Protein sequences 
wNch appeared more than once among tfie sequenced ctones and with the same DNA sec^ience are marked with a 

Note that after three rounds of selection, 2 different contaminating sequences were found; these clones did 
not correspond to cassette mutants, but to previously constructed hormone phage. The pS0643 contaminant 
corresponds to wild-type hGH-phage (hGH "KEFR"). The pH0457 contaminant, which dominates the third- 
round glycine-selected poo! of phage, corresponds to a previously identified mutant of hGH, "KSYR." The 
amplification of these contaminants emphasizes the ability of the hormone-phage selection process to select for 
rarely occuning mutants. The convergence of sequences is also striking in all three pathways: R or K occurs most 
often at positions 172 and 178; Y or F occurs most often at position 176; and S, T, A, and other residues occur at 
position 174. 

FIGURE 6. Sequences from phage selected on hPRLbp-beads in the presence of zinc. The notation is 
as described in Rgure. 5. Here, the convergence of sequences is not predictable, but there appears to be a bias 
towards hydrophobic sequences under the most stringent (Glycine) selection conditions; L ,W and P re^dues are 
frequently found in this pool. 

FIGURE 7. Sequences from phage selected on hPRLbp-beads in the absence of zinc. The notation is as 
described in Rgure 5. In contrast to the sequences of Figure. 6, these sequences appear more hydrophiiic. After 
4 rounds of selection using hGH elutfon, two ctones (ANHQ, and TLDT/171 V) dominate the pool. 

FIGURE 8. Sequences fnsm phage selected on blank beads. The notation is as described in Rg. 5. After 
three rounds of selection with glycine elution, no siblings were observed and a background level of non-functional 
sequences remained. 

FIGURE 9. Construction of phagemid fl ori from pH0415. This vector for cassette mutagenesis and 
expression of the hGH-gene til fusion protein was constructed as follows. Plasmid pS0643 was constructed by 
oligonucieotide-directed mutagenesis of pS0132, which contains pBR322 and fl origins of replication and 
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expresses an hGH-Qene III fusion protein (hGH residues 1-191, followed by a singte Gly residue, fused to Pro-198 
of gene III) under the control of the E. colt phoA promoter. Mutagenesis was can-ied out with the oligonucleotide 
5'-GGC-AGC-TGT-GGC-T TC-TAG-A GT-GGC-GGC-GGC>TCT-GGT-3'. which Introduced a ^iM site 
(underlined) and an amber stop codon (TAG) following Phe-191 of hGH. 
5 FIGURE ia A. Diagram of plasmid pDH188 insert containing the DNA encoding the light chain and 

heavy chain (variable and constant domain 1 ) of the Fab humanized antibody directed to the HER-2 receptor. Vl 
and Vh are the variable regions for the light and heavy chains, respectively. Ck is the constant region of the human 
kappa light chain. CH1 gi is the first constant region of the human gamma 1 chain. Both coding regions start with 
the bacterial st li signal sequence. B. A schematic diagram of the entire piasma pDH188 containing the insert 
1 0 described in 5A, After transformation of the plasmid into £ coli SR101 cells and the addition of helper phage, 
the plasmid is packaged into phage partictes. Some of these particles display the Fab-p HI fusion (where p 111 is 
the protein encoded by the M13 gene 11! DNA). Tt^ segments in the plasmid figure con-espond to the insert shown 
inSA. 

FIGURE 11 A through C are collectively referred to here as Rgure 11. The nucteotide (Seq. ID Ho. 25) 
1 5 sequence of the DNA encoding the 4D5 Fab nnolecuie expressed on the phagemid surface. The amino acid sequence 
of the light chain is also shovm (Seq. ID No. 26), as is the amino add sequence of the heavy chain p III fusion (Seq. ID 
No. 27). 

FIGURE 1Z Enrichment of wild-type 4D5 Fab pf^gemid from variant Fab P^^gemid. Mixtures of wild- 
type phagemid and variant 4D5 Fab phagemid in a ratio of 1:1,000 were selected on plates coated with the extra- 

20 cellular domain protein of the HER-2 receptor. After each round of selection, a portion of the eluted phagemid 
were infected into £ co// and plasmid DNA was prepared. This plasmid DNA was then digested with Eco RV and 
Pst !, separated on a 5% polyacrylamide gel, and stained with ethidium bromide. The bands were visualized under 
UV light. The bands due to the wild-type and variant plasmids are marked with arrows. The first round of 
selection was eluted only under acid conditions; subsequent rounds v^re eluted with either an acid elution (left 

25 side of Figure) or v^dth a humanized 4D5 antibody wash step prior to acid elution (right side of Figure) using 
methods described in Example VIII. Three variant 4D5 Fab rnolecties were made: H91 A (amino acid histtdine at 
position 91 on the Vl chain mutated to alanine; indicated as 'A' lanes in Figure), Y49A (amino acid tyrosine at 
position 49 on the Vl chain mutated to alanine; indicated as *B' lanes in the Rgure), and Y92A (amino add tyrosine 
at position 92 on the Vl chain mutated to alanine; indicated as 'C lanes in the Rgure). Amino add poation 

3 0 numbering is according to Kabat et al., (Sequences of proteins of immunobgical interest, 4th ed., U.S. Dept of 
Health and Human Sendees, Public Health Servce, f^ri. Institute of Health, Bethesda, MD [1987]). 

FIGURE 13. The Scatchard analysis of the RIA affinity determination described in Experimental 
Protocols is shown here. The amount of labeled ECD antigen that is bound is shown on the x-axis while the amount 
that is bound divided by the amount that is free is shown on the y-axis. The slope of the line indicates the Ka; the 

35 calculated is 1/Ka. 
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DETAILED DESCRIPTON OF THE WVENTION 

The following discussion will be best understood by referring to Rgure 1 . In its simplest form, the 
mettxxi of the instant invention comprises a method for selecting novel binding polypeptides, such as protein 
ligands, having a desired, usually high, affinity for a target molecule from a library of structurally related binding 
5 polypeptides. The library of stnxturaily related polypeptides, fused to a phage coat protein, is produced by 
mutagenesis and, preferably, a single copy of each related polypeptide is displayed on the surface of a phagemid 
particle containing DNA encoding that polypeptide. These phagemid particies are then contacted with a target 
molecule and those particles having the highest affinity for the target are separated from those of lower affinity. 
The high affinity binders are then amplified by infection of a bacterial host arxj the competitive binding step is 
1 0 repeated. This process is reiterated until polypeptides of the desired affinity are obtained. 

The novel binding polypeptides or ligands produced by the method of this invention are useful per se as 
diagnostics or therapeutics ( eg. agonists or antagonists) used in treatment of biological organisms. Structural 
analysis of the selected polypeptides may also be used to facilitate rational dajg design. 

By "binding polypeptide* as used herein is meant any polypeptide that binds with a selectable affinity to 
15 a target molecule. Preferably tf^e polypeptide will be a protein that most preferably contains more than about 
100 amino acid residues. Typically the polypeptide will be a hormone or an antibody or a fragment thereof. 

By "high affinity* as used herein is meant an affinity constant (K^ ) of <10"5 M and preferably <10'^M 
under physiological conditions. 

By "target molecule" as used herein is meant any molecule, not necessarily a protein, for which it is 
20 desirable to produce a ligand. Preferably, however, the target will be a protein and most preferably the target 
will be a receptor, such as a hormone receptor. 

By "humanized antibody* as used herein is meant an antibody in which tf>e complementarity-determining 
regions (CDRs) of a mouse or other non-human antibody are grafted onto a human antibody framework. By human 
antibody framewori^ is meant the entire human antibody excluding the CDRs. 

25 L Choice of Polypeptides for Display on the Surface of a Phage 

The first step in the method of this invention is to choose a polypeptide having rigid secondary 
structure exposed to the surface of the polypeptide for display on the surface of a phage. 

By •polypeptide" as used herein is meant any molecule whose expression can be directed by a specific 
DNA sequence. The polypeptides of this invention may comprise more than one subunit, where each subunit is 
3 0 encoded by a separate DNA sequence. 

By "rigid secondary structure* as used herein is meant any polypeptide segment exhibiting a regular 
repeated structure such as is found in; a-helices, 3io helices, jc-helices, parallel and antiparallel p-sheets, and 
reverse turns. Certain "non-ordered" structures that lack recognizable geometric order are also included in the 
definition of rigid secondary structure provided they form a domain or "patch" of amino acid residues capable of 
35 interaction with a target and that the overall shape of tfie structure is not destroyed by replacement of an amino 
acid within the structure . It is believed tiiat some non-ordered structures are combinations of reverse turns. The 
geometry of these rigid secondary structures is well defined by <(> and y torsonal angles about the a-carbons of 
the peptide "backbone". 
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The requirement that the secondary structure be exposed to the surtace of the polypeptide is to 
provide a domain or "patch' of amino acid residues that can be exposed to and bind with a target molecule. It is 
primarily these amino add residues that are replaced by mutagenesis that form the "library" of structurally 
related (mutant) binding polypeptides tf^t are displayed on the surface of the phage and from which novel 
5 polypeptide iigands are selected. Mutagenesis or replacement of amino ackJ residues directed toward the interior 
of the polypeptide is generally avoided so that the overall slructwe of the rigid secondary structure is preserved. 
Some replacement of amino acids on the interior region of the rigid secondary stnx:tures, especially with 
hydrophobic amino acid residues, may be tolerated since these conservative substitutions are unlikely to distort 
the overall structure of the polypeptide. 

1 0 Repeated cycles of 'polypeptide" selection are used to select for higher and higher affinity binding by 

the phagemid selection of multiple amino add changes which are selected by multiple selection cycles. Following a 
first round of phagemid selection, involving a first region or selection of amino adds in the ligand polypeptide, 
additional rounds of phagemid selection in other regions or amino adds of the ligand polypeptide are conducted. 
The cydes of phagemid selection are repeated until the desired affinity properties of the ligand polypeptide are 

1 5 achieved. To illustrate this process. Example VIII phagemid selection of hGH was conducted in cycles. In the first 
cyde hGH amino adds 172, 174, 176 and 178 were mutated and phagemid selected. In a second cycle hGH amino 
adds 167, 171 , 175 and 1 79 were phagemid selected In a third cyde hGH amino adds 10, 14, 18 and 21 were 
phagemid selected. Optimum amino acid changes from a previous cycle may be irxx)rporated into the polypeptide 
before the next cyde of selection. For example, hGH amino adds substitution 174 (serine) arvc 76 (tyrosine) 

20 were incorporated into the hGH before the phagemid selection of hGH amino acids 167, 171, i and 179. 

From the forgoing it will be appreciated that the amino add residues that form the binding domain of 
the polypeptide will not be sequentially linked and may reside on different subunits of the polypeptide. That is, 
the binding domain tracks with the particular secondary structure at the binding site and not the primary 
structure. Thus, generally, mutations will be introduced into codons encoding amino acids within a particular 

25 secondary structure at sites directed away from the interior of the polypeptide so that they will have the 

potential to interact with the target. By way of illustration. Figure 2 shows the location of residues in hGH that 
are known to strongly modulate its binding to the hGH-binding protein (Cunningham et ai, Sdence 247:1 46 1 - 
1465 [1990]). Thus representative sites suitable for mutagenesis would indude residues 172, 174, 176, and 178 
on heIix-4, as well as residue 64 located in a "non-ordered" secondary structure. 

3 0 There is no requirement that the polypeptide chosen as a ligand to a target normally bind to that target. 

Thus, for example, a glycoprotein hormone such as TSH can be chosen as a ligand for the FSH receptor and a 
library of mutant TSH molecules are employed in the meM of this invention to produce novel drug candidates. 

This invention thus contemplates any polypeptide that binds to a target molecule, and indudes 
antibodies. Preferred polypeptides are those that have pharmaceutical utility. More preferred polypeptides 

3 5 indude; a growth hormone, induding human growth hormone, des-N-metiiionyi hunan growth homwne, and bovine 
growth hormone; parathyroid hormone; thyroid stimulating homione; thyroxine; insulin A-chain; insulin B-chain; 
proinsulin; follicle stimulating homione; caldtonin; leulinizing hormone; glucagon; factor Vtll; an antibody; lung 
surfactant; a plasminogen activator, such as urokinase or human tissue-type plasminogen activator (t-PA); 
bombesin; factor IX, thrombin; hemopoietic growth factor; tumor necrosis factor-alpha and -beta; enkephalioase; a 
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seam albumin such as human seam albumin; miitenan^nhibiting substance; relaxin A-chain; relaxin B-chain; 
prorelaxin; mouse gonaciotropin-associated peptide; a microbial protein, such as betalactamase; tissue factor 
protein; inhtbin; activin; vascular endothelial growth factor; receptors for hormones or growth factors; integrin; 
thrombopoietin; protein A or D; rheumatoid factors; nerve growth factor such as NGF-p; platelet-derived growth 
5 factor; fibroblast growth factor such as aFGF and bPGF; epidermal growth factor; transforming growth factor 
(TGF) such as TGF-aipha and TGF-beta; insulin-like growth factor-l and -II; insulin-like growth factor binding 
proteins; CD-4; DNase; latency associated peptide; erythropoietin; osteoinductive factors; an interferon such as 
Interferon-alpha, -beta, and -gamma; colony stimulating factors <CSFs), e.g.. M-CSF, GM-CSF, and G-CSF; 
interleukins (ILs), e.g., IL-1, tL-2, IL-3, IL-4, etc.; superoxide dismutase; decay accelerating factor; atrial 

1 0 natriuretic peptides A, B or C; viral antigen such as. for exampte, a portion of the HIV envelope; immunogiobulins; 
and fragments of any of the above-listed polypeptides. In addition, one or more predetermined amino add 
residues on the polypeptide may be substituted, Inserted, or deleted, for example, to produce products with 
improved biological properties. Further, fragments of these polypeptides, especially biologically active 
fragments, are included. Yet more preferred polypeptides of this invention are human growth hormone , and 

1 5 atrial naturetic peptides A, B, and C. endotoxin, subtilisin, trypsin and other serine proteases. 

Still more preferred are polypeptide hormones that can be defined as any amino acid sequence produced 
in a first cell that binds specifically to a receptor on the same cell type (autocrine hormones) or a second cell type 
(non-autocrine) and causes a physiotogicai response characteristic of the receptor-bearing cell. Among such 
polypeptide hormones are cytokines, lymphokines, neurotrophic hormones and adenohypophyseai polypeptide 

2 0 hormones such as growth hormone, prolactin, placental lactogen, luteinizing hormone, follicle-stimulating hormone, 

thyrotropin, chorionic gonadotropin, corticotropin, a or p-melanocyte-stimulating hormone, p-lipotropin, y- 
lipotropin and the endorphins; hypothalmc release-inhibiting hormones such as corticotropin-reiease factor, 
growth hormone release-inhibiting hormone, growth hormone-release factor; and other polypeptide hormones such 
as atrial natriuretic peptides A, B or C. 
25 R. QWainihQ a Rrst Gene (Gene 1) encoding th^ desired poivpeptide 

The gene encoding the desired polypeptkte (i.e., a poIypeptkJe with a rigid secondary structure) can be 
obtained by methods known in the art (see generally, Sambrook et al, . Molecular Biology: A Laboratory Manual . 
Cold Spring Harbor Press, Cold Spring Harbor, New York [1989]). If the sequence of the gene is known, the 
DNA encoding the gene may be chemfcally synthesized (MenHeW, J. Am. Chem. Soc, ,. 85 2149 [1963]). If the 

3 0 sequence of the gene is not knovim, or If the gene has not previously been isolated, it may be ctoned from a cDNA 

library (made from RNA obtained from a suitable tissue in which the desired gene Is expressed) or from a suitable 
gerromk; DNA library. The gene is then isolated using an appropriate probe. For cDNA libraries, suitable probes 
include monoclonal or polyclonal antibodies (provkjed that the cDNA library is an expressbn library), 
oligonudeot'des, and complementary or homologous cDNAs or fragments thereof. The probes that may be used to 
35 isolate the gene of interest from genomk; DNA libraries include cDNAs or fragments thereof that encode the same 
or a similar gene, homok>gous genome DNAs or DNA fragments, and oligonucleotides. Screening the cDNA or 
gerromic library with the selected probe is conducted using standard procedures as described in chapters 10-12 
of Sambrook etal., supra. 
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An alternative means to isolating the gene encoding the protein of interest is to use polymerase chain 
reaction methodology (PGR) as described in section 14 of Sambrook etal., supra. This method requires the use 
of oligonucleotides that will hybridize to the gene of interest; thus, at least some of the DNA sequence for this 
gene must be known in order to ger^rate the oiigonucieotides. 
5 After the gene has been isolated, it may be inserted into a suitable vector (preferably a plasmid) for 

amplification, as described generally in Sambrook et al., supra. 
i. Constructing Reollcabte Exoression Vectors 

White several types of vectors are available and may be used to practice this invention, plasmid vectors 
are the prefen^d vectors for use herein, as they may be constnjcted with relative ease, and can be readily 
1 0 amplified. Plasmid vectors generally contain a variety of components including promoters, signal sequences, 
phenotypic setection genes, origin of JBplication sites, and other necessary components as are known to those of 
ordinary skill in the art. 

Promoters most commonly used in ipfoksq^ vectorsindude theJac Z promoter system, the alkaline 
phosphatase cbQ A promoter, the bacter^^p^noge XPL promrrtBr ^ temperature sensitive promoter), the lac 
1 5 promoter (a hybrid lip-lac promoter ^t is regulated by the lat repressor), the fiyptophan promoter, and the 
bacteriophage 17 promoter. For ^er^eral descryn^ions of promoters, see section 17 of Sambrook ei ah supra , 
While these are the most commonly used promctets, other suiteibte nacrobial promoters may be used as well. 

Preferred promoters for practicing this invertiicfn are tho^ thai can be tightly regulated such that 
expression of the fusion gene can be controlled. It is believed thaJ«ie profafem that went unrecognized in the 

2 0 prior art was that display of multiple copies of the fusion protein on the surface d fne phagemid particle lead to 

multipoint attachment of the phageyiud m$)i ihe target It is believed this effect, referred to as the "chelate 
effect", results in selection of false '^igh afrtT%" polypeptides when multiple copies of the fusion protein are 
displayed on the phagemid particte in close proximity to ot« anofier so that the target was "chelated". When 
multipoint attachment occurs, the effective or apparent Kd may be as high as the product of the individual Kds 

25 for each copy of the displayed fusion protein. This effect may be the reason Cwiria and coworkers supra were 
unabie to separate moderate affinity peptides from higher affinity peptides. 

It has been discovered that by tightly regulating expression of the fusion protein so that no more than a 
minor amount, i.e. fewer than about 1%, of the phagemid partictes contain multiple copies of the fusion protein the 
"chelate effect* is overcome allowing proper selection of high affinity polypeptides. Thus, depending on the 

30 promoter, culturing conditions of the host are adjusted to maximize the number of phagemid particles containing a 
single copy of the fusion protein and minimize the number of phagemkJ particles containing multiple copies of the 
fusion protein. 

Preferred promoters used to practice this invention are the lac Z promoter and the ^ A promoter. 
The lac Z promoter is regulated by the lac repressor protein lac i, and thus transcription of the fusion gene can be 

3 5 controlled by manipulation of the level of the lac repressor protein. By way of illustration, the phagemid containing 

the lac Z promoter is grown in a cell strain that contains a copy of the lac i repressor gene, a repressor for the 
lac Z promoter. Exemplary cell strains containing the lac i gene include JM 101 and XL1-b!ue. In the alternative, 
the host ceil can be cotransfected with a plasmid containing both the repressor lac i and the lac Z promotor. 
Occasionally both of the above techniques are used simultaneously, that is, phagmide particles containing the lac Z 
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promoter are grown in cell strains containing the laQ i gene and the ceH strains are cotranslected with a plasmid 
cxMTtaining both the lac Z and lac i genes, ^4ormally when one wishes to express a gene, to the transfected host 
above one woukj add an inducer such as isopropylthiogalactoside (IPTG). in the present invention however, this 
step is omitted to (a) minimize the expression of the gene flf fusion protein thereby minimizing the copy number 
5 (i.e. the number of gene Hi fusions per phagemid number) and to (b) prevent poor or improper packaging of the 
phagemid caused by inducers such as IPTG even at low concentrations. Typicaliy, when no inducer Is added, the 
number of fusion proteins per phagemid particle is about 0.1 (number of bulk fusion proteins/number of phagemid 
partides). The most pretend promoter used to practice this invention is i^bfi A. This promoter is believed to be 
regulated by the level of inorganc phosphate in the cell where the phosphate acts to down-regulate the activity of 

1 0 the promoter. Thus, by depleting cells of phosphate, the activity of the promoter can be increased. The desired 
result is acNeved by growing cells in a phosphate enriched medium such as 2YT or LB thereby controlling the 
expression of the gene ill fusion. 

One other usef J component of vectors used to practice this invention is a signal sec^nce. This sequence 
is typically located immediately 5' to the gene encoding the fusion protein, and will thus be transcribed at the amino 

1 5 terminus of the fusk)n protein. However, in certain cases, the signal sequence has been demonstrated to be k>cated 
at positions other 5' to the gene encoding the protein to be seaeted. This sequence targets the protein to which 
it is attached across the inner membrane of the bacterial cell. The DNA encoding the signal sequence may be 
obtained as a restriction endonuclease fragment from any gene encoding a protein that has a signal sequence. 
Suitable prokaryotic signal sequences may be obtained from genes encoding, for example. LamB or OmpF (Wong 

20 et ai, fi£n£. 68:193 [1983]). MalE, PhoA and other genes. A preferred prokaryotic signal sequence for practicing 
this invention is the £ coli heat-stabie enterotoxin II (STll) signal sequence as described by Chang et al. , Gene . 
55: 189 [1987]. 

Another useful component of the vectors used to practrce this invention is phenotypic selection genes. 
Typical phenotypic selection genes are those encoding proteins that confer antibiotic resistance upon the host cell. 
25 By way of illustration, the ampciilin resistance gene (amiS). and the tetracycline resistance gene (lei) are readily 
employed for this purpose. 

Constnjction of suitable vectors comprising the aforementioned components as well as the gene encoding 
the desired polypeptide (gene 1) are prepared using standard recombinant DNA procedures as described in 
Sambrook et al supra. Isolated DNA fragments to be combined to form the vector are cleaved, tailored, and 
30 ligated together In a specific order and orientation to generate the desired vector. 

The DNA is cleaved using the appropriate restriction enzyme or enzymes in a suitable buffer. In general, 
about 0.2-1 Jig of plasmid or DNA fragments is used with about 1-2 units of the appropriate restriction enzyme 
in about 20 nl of buffer solution. Appropriate buffers, DNA concentrations, and incubation times and 
temperatures are specified by the manufacturers of the restriction enzymes. Generally, incubation times of about 
35 one or two hours at 37'C are adequate, althougfi several enzymes require higher temperatures. After incubation, 
the enzymes and other contaminants are removed by extractfon of the digestion solution with a mixture of phenol 
and chlorofomi, and the DNA is recovered from the aqueous fraction by precipitation vwth ethanol. 

To ligate the DNA fragments together to form a functional vector, the ends of the DNA fragments 
must be compatible with each other. In some cases, the ends will be directly compatible after endonuclease 



digestion. However, rt may be necessary to first convert the sticky ends commonly produced by endonuciease 
digestion to blunt ends to make them compatible for ligation. To blunt the ends, the DNA is treated in a suitable 
buffer for at least 15 minutes at 15'C with 10 units of of the Klenow fragment of DNA polymerase I (Klenow) in 
the presence of the four deoxynudeotide triphosphates. The DNA is then purified by phenol-chloroform 
5 extraction and ethanol precipitatk)n. 

The cleaved DNA fragments may be size-separated and selected using DNA gel electrophoresis. The 
DNA may be electrophoresed through either an agarose or a polyacryiamkte matrix. The selection of the matrix 
will depend on the size of the DNA fragments to be separated. After electrophoresis, the DNA is extracted from 
the matrix by electroelution, or, if low-melting agarose has been used as the matrix, by melting the agarose and 

1 0 extracting the DNA from it, as described in sections 6.30-6.33 of Sambrook et at., supra. 

The DNA fragments that are to be iigated together (previously digested with the appropriate 
restrction enzymes such that the ends of each fragment to be Iigated are compatible) are put in solution in about 
equimolar amounts. The solution will also contain ATP, ligase buffer and a ligase such as T4 DNA ligase at about 
10 units per 0.5 \iq of DNA. If the DNA fragment is to be Iigated into a vector, the vector is at first linearized by 

1 5 cutting with the appropriate restriction endonuclease(s). The linearized vector is then treated with alkaline 
phosphatase or calf intestinal phosphatase. The phosphatasing prevents self-ligatk)n of the vector during the 
ligation step. 

After ligation, the vector with the foreign gene now inserted is transfonried into a suitable host cell. 
Prokaryotes are the preferred host cells for this invention. Suitable prokaryotk; host cells include E coll strain 

20 JM101 , E coli K12 strain 294 (ATCC number 31,446), E coli strain W31 10 (ATCC number 27,325). E coH 
XI 776 (ATCC number 31,537), E coli XL-IBiue (stratagene), and E coli B; however many other strains of E 
CO//, such as HB101 , NM522, NM538. NM539, and many other species and genera of prokaryotes may be used as 
well. In addition to the E coli strains listed above, bacilli such as Bacillus subtilis. other enterobacteriaceae such as 
Salmonella typhimijrium or Serratia mar(^^n^, and various Pseudomonas species may all be used as hosts. 

25 Transformation of prokaryotic cells is readily accomplished using the calcium chloride method as 

described in section 1 .82 of Sambrook et aL, supra. Alternatively, electroporation (Neumann et al., EMBQ J.. 
1 :841 [1 982]) may be used to transform these cells. The transformed cells are selected by growth on an 
antibiotic, commonly tetracycline (tet) or ampicillin (amp), to whk:h they are rendetBd resistant due to the 
presence of tet and/or amp resistance genes on the vector. 

30 After selection of the transfonned cells, these cells are grown in culture and the plasmid DNA (or other 

vector vwth the foreign gene inserted) is then Isolated. Piasmki DNA can be isolated using methods known in the 
an Two suitable methods are the small scale preparation of DNA and the large-scale preparation of DNA as 
described in secttons 1 .25-1 .33 of Sambrook et a!., supra. The isolated DNA can be purified by methods known in 
the art such as that described in section 1 .40 of Sambrook etaL, supra. This purified plasmid DNA is then 

3 5 analyzed by restrk;tbn mapping and/or DNA sequencing. DNA sequencing is generally performed by either the 
method of Messing era/. NudgiC Aci^fe Rfifr., 9309 [1981] or by the method of Maxam etai Meth. Enzvmol . 6S- 
499 [1980]. 
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IV. j^D^B^ 

This invention contemplates fusing the gene enclosing the desired polypeptide (gene 1 ) to a second gene 
(gene 2) such that a fusion protein is generated during transcription. Gene 2 is typically a coat protein gene of a 
phage, and preferably it is the phage M13 gene til coat protein, or a fragment thereof. Fusion of genes 1 and 2 may 
5 be accomplished by inserting gene 2 into a particular site on a plasmid that contains gene 1 , or by inserting gene 1 
into a particular site on a plasmid that contains gene 2. 

Insertion of a gene into a plasmid requires that the plasmid be cut at the precise location that the gene is 
to be inserted. Thus, there must be a restriction endonudease site at this location (preferably a unique site such 
that the plasmid will only be cut at a single location during restriction endonudease digestion). The plasmid is 

1 0 digested, phosphatased, and purified as described above. The gene is then inserted into this linearized ptasmid by 
ligating the two DNAs together, ligation can be accomplished if the ends of the plasmid are compatible with the 
ends of the gene to be inserted. If the restriction enzymes are used to cut the plasmid and isolate the gene to be 
inserted create blunt ends or compatible sticky ends, the DNAs can be ligated together directly using a ligase 
such as bacteriophage T4 DNA ligase and incubating the mixture at 16'C for 1-4 hours in the presence of ATP 

1 5 and ligase buffer as described in section 1 .68 of Sambrooic eta!., succa- If the ends are not compatible, they must 
first be made blunt by ustr^ the fOenow fragment of DNA polymerase I or bacteriophage T4 DNA polymerase, 
both of which require the four deoxyribonudeotide triphosphates to fill-in overhanging single-stranded ends of 
the digested DNA. Alternatively, tt^e ends may be blunted using a nuclease such as nudease SI or mung-bean 
nudease, both of which function by cutting back the overhanging single strands of DNA. The DNA is then 

2 0 religated using a ligase as described above. In some cases, it may not be possible to blunt the ends of the gene to 
be inserted, as the reading frame of the coding region will be altered. To overcome this problem, oligonucleotide 
linkers may be used. The linkers serve as a brkige to connect the plasmid to the gene to be inserted. These linkers 
can be made synthetically as double stranded or single stranded DNA using standard methods. The linkers have 
one end that is compatible with the ends of the gene to be inserted; the linkers are first ligated to this gene using 

2 5 ligation methods described above. The other end of the linkers is designed to be compatible with tiie plasmid tor 

ligation. In designing the linkers, care must be taken to not destroy the reading frame of the gene to be inserted 
or the reading frame of the gene contained on the plasmid In some cases, it may be necessary to design the linkers 
such that they code for part of an amino acid, or such that they code for one or more amino adds. 

Between gene 1 and gene 2, DNA encoding a teonination codon may be inserted, such termination codons 
30 are UAG( amber), UAA (ocher) and UGA (opei). (Microbiobgy, Davis et al. Harper & Row, New Yori^, 1980, 

pages 237, 245-47 and 274). The terminaton codon expressed in a wiki type host cell results in the synthesis of 
the gene 1 protein product without the gene 2 protein attached. However, growth \n a suppressor host cell 
results in the synthesis of detectable quantities of fused protein. Such suppressor host cells contain a tRNA 
modified to insert an amino add in the terminatton codon positk)n of the mRNA thereby resulting in prodxtion of 

3 5 detectible amounts of the fusion protein. Such suppressor host cells are vi«ll known and described, such as E.coli 

suppressor strain (Bullock et al.. BioTechniques 5. 376-379 [19871). Any acceptable method may be used to 
place such a termination codon into the mRNA encoding the fusion polypeptide. 

The suppresstble codon may be inserted between the first gene encoding a polypeptide, and a second 
gene encoding at least a portion of a phage coal protein. Alternatively, the suppressive termination codon may be 



inserted adjacent to the fusion site by replacing the last amino ackJ Iriptet in the polypeptide or the first amino 
acid in the phage coat protein. When the phagemid containing the suppressiWe cocJon is grown in a suppressor 
host cell, it results in the detectable production of a fusion polypeptide containing the polypeptide and the coat 
protein. When the phagemid is grown in a non-suppressor host ceil, the polypeptide is synthesized substantially 
5 without fusion to the phage coat protein due to termination at the inserted suppressiWe triplet encoding UAG, 
UAA or UGA. In the non-suppressor cell the polypeptide is synthesized and secreted from the host celt due to the 
absence of the fused phage coat protein which othenvise anchored it to the host cell. 
V. AlteratlQn(mutation) of Gene 1 at Selected Positions 

Gene 1 , encoding the desired polypeptide, may be altered at one or more selected codons. An alteration 

10 is defined as a substitu&>n, deletion, or insertion of one or more codons in the gene encoding the polypeptide that 
results in a change in the amino acid sequence of the polypeptide as compared with the unaltered or native 
sequence of the same polypeptide. Preferably, the alterations will be by substitution of at least one amino acid 
with any other amino acid in one or more regions of the molecule. The alterations may be produced be a variety of 
methods known in the art These methods include but are not limited to oligonucteotide-mediated mutagenesis and 

1 5 cassette mutagenesis. 

A. Qiioonudeotide-Mediated Mutagenesfs 

Oligonucleotide -mediated mutagenesis is preferred method for preparing substitution, deletion, and 
insertion variants of gene 1 . This technique is well known in the art as described by Zoller et al. Nucleic Acids Res. 
Jfi: 6487-6504 [1987]. Briefly, gene 1 is altered by hybridizing an oligonucleotide encoding the desired mutation 

20 to a DNA template, where the template is the single-stranded form of the plasmid containing the unaltered or 
native DNA sequence of gene 1 . After hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template will thus incorporate the oligonucieotide primer, and will code for the 
selected alteration in gene 1 . 

Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide will 

25 have 12 to 15 nucleotides that are completely complementary to the template on either side of the nucieotide{s) 
coding for the mutation. This ensures that the oligonucleotide will hybridize properly to the single-stranded DNA 
template molecule. The oligonucleotides are readily synthesized using techniques known in the art such as that 
described by Grea etaL Proc. Natl. Acad Sr^ n<;/\, 75: 5765 [1978]. 

The DNA template can only be generated by those vectors that are either derived from bacteriophage 

30 M13 vectors (the commercially available M13mp18 and M13mp19 vectors are suitable), or those vectors that 
contain a single-stranded phage origin of replication as described by Viera et aJ. Meth. EnzvmoL. 153: 3 [1 987]. 
Thus, the DNA that is to be mutated must be Inserted into one of these vectors in order to generate single- 
stranded template. Production of the single-stranded template is described In sections 4.21-4.41 of Sambrook 
etaL, supra, 

T"^> alter the native DNA sequence, the oligonucleotide is hybridized to the single stranded template 
under suitable hybridization conditions. A DNA polymerizing enzyme, usually the Klenow fragment of DNA 
polymerase I, is then added to synthesize the complementary strand of the template using the oligonucieo^de as a 
primer for synthesis. A heteroduplex molecule is thus formed such that one strand of DNA encodes the mutated 
form of gene 1 , and the other strand {the original template) encodes the native, unaltered sequence of gene 1. 



This heterodupiex molecule is then transformed into a siitabte host cell, usually a prokaryote such as E Coli 
JM101- After growing the cells, they are plated onto agarose plates and screened using the oligonucleotide primer 
radiolabelled with 32-Phosphate to identify the bacterial colonies that contain the mutated DNA. 

The method described immediately above may be modified such that a homoduplex molecule is created 
5 wherein botfi strands of the plasmid contain the mutation(s). The modifications are as follows: The single- 
stranded oligonucleotide is annealed to the single-stranded template as described above. A mixture of three 
deoxyribonudeotides, deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP), and deoxyribothymldine (dTTP), 
is combined with a modified thio^xyribocytosine called dCTP-<aS) (which can be obtained from Amersham). 
This mixture is added to the template-oligonucieotide complex. Upon addition of DNA polymerase to this mixture, 

1 0 a strand of DNA identical to the template except for the mutated bases is generated. In addition, this new strand 
of DNA will contain dCTP-{aS) instead of dCTP, which serves to protect it from restriction endonudease 
digestion. After the template strand of the douWe-stranded heteroduplex is nicked with an appropriate 
restriction enzyme, the template strand can be digested with Exolli nuclease or another appropriate nuclease past 
the region that contains the stte(s) to be mutagenized. The reaction is then stopped to leave a molecule that is 

1 5 only partially single-stranded. A complete double-stranded DNA homoduplex is then formed using DNA 

polymerase tn the presence of all four deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex 
molecule can then be transformed Into a suitable host cell such as £ coli JM101 . as described above. 

Mutants with more than one amino acid to be substituted may be generated in one of several ways. If the 
amino acids are tocated close together in the polypeptide chain, they may be mutated simultaneously using one 

2 0 oligonucleotide that codes for all of the desired amino acid substitutions. If, tiowever, the amino acids are located 
some distance from each other (separated by more than about ten amino acids), it Is more difficult to generate a 
single oligonucleotide that encodes all of the desired changes. Instead, one of two alternative methods may be 
employed. 

In the first method, a separate oligonucleotkJe is generated for each amino acid to be substituted. The 
25 oligonucleotides are then annealed to the single-stranded template DNA simultaneously, and the second strand of 
DNA that is synthesized from the template will encode all of the desired amino acid substitutions. The alternative 
method involves two or more rounds of mutagenesis to produce the desired mutant. The first round is as 
described for the sir^le mutants: wild-type DNA is used for the template, an oligonucleotkJe encoding the first 
desired amino add substituton(s) is annealed to this template, and the heteroduplex DNA molecule is then 
30 generated. The second round of mutagenesis utilizes the mutated DNA produced in the first round of 

mutagenesis as the template. Thus, this template already contains one or more mutations. The oligonucleotide 
encoding the additfonal desired amino acki substitution{s) is then annealed to this template, and the resulting 
strand of DNA now encodes mutations from both the first and second rounds of mutagenesis. This resultant DNA 
can be used as a template In a third round of mutager«sis, and so on. 
35 B. C^ssgtteMutaqengsls 

This method is also a preferred method for preparing substitution, deletion, and insertbn variants of 
gene 1. The method is based on that described by Wells etal. Gene. 34:315 [1985].. The starting material is the 
plasmid (or other vector) comprising gene 1 , the gene to be mutated. The codon(s) in gene 1 to be mutated are 
identified. There must be a unk^ue restrictbn endonudease site on each side of the Wentified mutation site(s). If 
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no such restriction sites exist. lf)ey may be generated using the atwve-described oligonucieotide-mediated 
mutagenesis method to introduce them at appropriate locations in gene 1. After the restriction sites have been 
introduced into the plasmid, the piasmid is cut at these sites to linearize it. A double-stranded oligonucleotide 
encoding the sequence of the DNA between the restriction sites but containing the desired mutation{s) is 
5 synthesized using standard procedures. The two strands are synthesized separately and then hybridized 
together using standard techniques. This double-stranded oligonudeotide is refenred to as the cassette. This 
cassette is designed to have 3* and 5' ends that are compatible with the ends of the linearized plasmid, such that it 
can be directly iigated to the plasmid. This piasmid now contains the mutated DNA sequence of gene 1 . 

VI. Qbtainina DNA encodtno the desirpd protrin 

0 *n an alternative embodiment, this invention contemplates producbon of variants of a desired protein 

containing one or more subunlts. Each subunit is typically encoded by separate gene. Each gene encoding each 
subunit can be obtained by methods knoMvn the art (see, for example, Section II). in some instances, it may be 
necessary to obtain the gene encoding the various subunits using separate techniques selected from any of the 
methods described in Section 11. 
^ 5 When construcSng a replicaWe expression vector where the protein of interest contains more than one 

subunit, all subunits can be regulated by the same promoter, typically located 5" to the DNA encoding the subunits, 
or each may be regulated by separate promoter suitably oriented in the vector so that each promoter is operably 
linked to the DNA it is intended to regulate . Selection of promoters is carried out as described in Section IN 
above. 

constructing a replicable expression vector containing DNA encoding the protein of interest having 
multiple subunits, the reader is referred to Figure 10 where, by way of illustration, a vector is diagrammed 
showing DNA encoding each subunit of an antibody fragment. This figure shows that, generally, one of the 
subunits of the protein of interest wilt be fused to a phage coat protein such as Ml 3 gene 111. This gene fusion 
generally will contain its own signal sequence. A separate gene encodes the other subunit or subunits, and It is 

25 apparent that each subunit generally has its ovm signal sequence. Figure 10 also shows that a single promoter can 
regulate the expression of both subunits. Aitematively, each subunit may be independently regulated by a 
different promoter. The protein of interest subunit-phage coat protein fusion construct can be made as 
described in Section IV above. 

When constructing a family of variants of the desired mulH-subunit protein, DNA encoding each subunit 

30 ffi the vector may mutated in one a more positions in each subunit When multi-subunit antibody variants are 
constructed, prefen-ed sites of mutagenesis con'espond to codons encoding amino add residues located in the 
complementarity-delemiining regions (CDR) of either the light chain, the heavy chain, or both chains. The CDRs 
are commonly referred to as the hypervariable regbns. Methods for mutagenizing DNA encoding each subunit of 
the protein of interest are conducted essentially as described in Section V above. 

35 

VII. PreDartno a Taroet Molecule and Bindfno v^rith Phao^ 

Target proteins, such as receptors, may be isolated from natural sources or prepared by recombinant 
methods by procedures known in the art. By my of illustration, glycoprotein hormone receptors may be prepared 
by the technique described by McFartand eiaL Science 245:494-499 [1989], nonglycosyiated forms expressed 
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in E CO// are ciescribed by Fuh et ai. J. Biol. Chem 265:31 1 1-31 15 [1990] Other receptors can be prepared by 
standard methods. 

The purified target protein may be attached to a suitable matrix such as agarose beads, acrylamide 
t)eads, glass beads, cellulose, varwus acrylic copolymers, hydroxyialkyi methacryiate gels, polyacrylic and 
5 polymethacryiic copolymers, nyion. neutral and ionic carriers, and the Ske. Attachment of the target protein to the 
matrix may be accomplished by methods described in Methods in Enzvmolopv , 44 [1 976], or by other means known 
in the art. 

After attachment of ^ target protein to the matrix, the immobilized target is contacted with the 
library of phagemkj partctes under conditions suitable for binding of at least a portwn of the phagemid particles 
1 0 with the immobilized target. Normaiiy, tt)e conditions, including pH, ionic strengtii, temperature and the like will 
mime physiotogica! conditions. 

Bound phagemid particles Cbinders") having high affinity for the immobilized target are separated 
from those having a low affinity (and thus do not bind to the target) by washing. Binders may be dissociated 
from the Immobilized target by a variety of methods. These methods include competitive dissociation using the 
1 5 wikJ-type ligand, altering pH and/or ionic strength, and methods known in the art. 

Suitable host cells are infected with the binders and helper phage, and the host cells are culUired under 
conditions suitable for amplification of the phagemid particles. The phagemki particles are then collected and the 
selection process is repeated one or more times until binders having the desired affinity for the target molecule 
are selected. 

2 0 Optionally the library of phagemid particles may be sequentially contacted with more than one 

immobilized target to improve selectivity for a particular target. For example, it is often the case that a ligand 
such as hGH has more than one natural receptor. In the case of hGH, both the grov^ hormone receptor and the 
prolactin receptor bind the hGH ligand. It may be desirable to improve the selectivity of hGH for the growth 
hormone receptor over the prolactin receptor. This can be achieved by first contacting the library of phagemid 

25 particles with immobilized prolactin receptor, eluting those wit*-- a low affinity {i.e. lower than wild type hGH( for 
the prolactin receptor and then contacting the tow affinity prolactin "binders" or non-binders with the 
immobilized grovrth honnone receptor, and selecting for high affinity growth hormone receptor binders. In this 
case an hGH mutant having a tower affinity for the prolactin receptor would have therapeutto utility even if the 
affinity for the growth hormone receptor were somewhat tower than that of wild type hGH. This same strategy 

30 may be employed to improve selectivity of a particular hormone or protein for its primary function receptor over 
its clearance receptor. 

In another embodiment of this inventton, an improved substrate amino acto sequence can be obtained. 
These may be useful for making better "cut sites' for protein linkers, or for better protease 
substrates/Inhibitors. In this embodiment, an Immobilizable molecule (e.g. hGH-receptor, btotin-avidin, or one 
3 5 capable of covalent linkage with a matrix) is fused to gene ill through a linker. The linker will preferably be from 3 
to 10 amino acids in length and will act as a substrate for a protease. A phagemid will be constructed as desaibed 
above where the DNA encoding the linker regton is rancfomly mutated to produce a randomized library of phagemid 
particles with different amino acid sequences at the linking site. The library of phagemid particles are then 
immobilized on a matrix and exposed to a desired protease. Phagemid particles having preferred or better 
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substrate amino acid sequences in the liner region for the desired protease will te eiuted, first producing an 
enriched pool of phagemid particies encoding preferred linkers. These phagemid partides are then cycled several 
more times to produce an enrich pool of particies encoding consense sequence(s) (see examptes XIII and XIV). 
Vltl. Growth Honrio ne Variants and Methods of ^ 
5 The cioned gene tor hGH has been expressed in a secreted form in Eschericha cola (Chang, C. H>,etal, 

[1987] figjQ£ and its DNA and amino acid sequence has been reported (Goeddel, et al. [1 979] Nature ggl . 
544; Gray et aL, [1985] fi£D£ ^, 247). The present invention describes novel hGH variants produced usj'ng the 
phagemid selection methods. Human growth hormone variants containing subsfitutions at positions 10, 14, 18, 21 , 
167, 17t, 172, 174, 175, 176, 178 and 179 have been described. Those having higher binding affinities are 

1 0 desaibed Tables Vll, XIH and XIV. The amino acid nomerK^ture for describing the variants is shown below. 

Growth hormone variants may be administered and fomiuiated in the same manner as regular grow^ honnone. The 
growth hormone variants of the present invention may be expressed in any recombinant system which is capable of 
expressing native or met hGH. 

Therapeutic formulations of hGH for therapeutic administration are prepared for storage by mixing 

15 hGH having the desired degree of purity with optional phy&oiogically acceptable can-iers, excipients, or 

stabilizers (Reminoton's Pharmaceutical Sciences. 16th edition, Osol, A., Ed., (1980)., in U>e form of lyophilized 
cake or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the dosages 
and concentrations employed, and include buffers such as phosphate, citrate, and other organic acids; antioxidants 
including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, sxh as serum 

2 0 albumin, gelatin, or immunogtobuiins; hyckophiiic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 

glutamine, asparagine, arginine, or lysine; monosaccharides, disaccharides, and other carbohydrates including 
glucose, manrtose, or dextrins; chelating agents such as EDTA; divalent metal iors such as zinc, cobalt or copper; 
sugar alcohols such as mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants 
such as Tween, Pluronics or polyethylene glycol (PEG). Formulations of the present invention may additionally 
25 contain a phamnaceutically acceptable buffer, amino acid, bulking agent and/or non-ionic surfactant. These include, 
for example, buffers, chelating agents, antioxidants, preservatives, cosolvents, and the like; specific examples of 
these could include, trimethyiamaine salts ("Tris buffer), and disodium edetate. The phagemids of the present 
invention may be used to produce quantities of the hGH variants free of the phage protein. To express hGH 
variants free of the gene III portion of the fusion, pS0643 and derivatives can simply be grown in a non- 

3 0 suppressor strain such as 1 6C9. In this case, the amber codon (TAG) leads to termination of translation, which 

yields free hormone, without the need for an independent DNA constnxt'on. The hGH variant is secreted from the 
host and may be isolated from the culture medium. 

One or more of the eight hGH amino adds FIG, M14, H18. H21, R167, D171, T175 and 1179 may be reped 
by any amino add other than the one found in that position in naturally occuning hGH as indicated. Therefore, 1 , 2, 
35 3, 4, 5. 6, 7, or all 8 of the indicated amino acids, FID, M14, H18, H21 , R167, D171. T175 and 1179, may be replaced 
by any of the other 19 amino acids out of the 20 amino acids listed below. In a prefen-ed embodiment, all eight 
listed amino acids are replaced by another amino acid. The most preferred eight amino acids to be substituted are 
indicated in Table XIV in Example Xll. 
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Antio acid nomecxialure. 

Ala (A) 
Arg (R) 

5 Asn (N) 

Asp (D) 
Cys (C) 
Gin (Q) 
Giu (E) 

1 0 Gly (G) 

His (H) 
He (I) 
Leu (L) 
Lys (K) 

1 5 Met (M) 

Phe (F) 
Pro (P) 
Ser (S) 
Thr (T) 

20 Trp (W) 

Tyr (Y) 
Vat (V) 

The one letter hGH variant nomenclature first gives the hGH amino acid deleted, for example gtutamate 179; then 
the amino acid inserted; for example, serine; resulting in {E1795S). 

25 

EXAMPLES 

Without further description, it is believed that one of ordinary skill in the art can, using the preceding 
description and illustrative examples, make and utilize the present invention to the fullest extent. The following 
working examples therefore specifically point out preferred embodiments of the present invention, and are not to 
30 be constnjed as limiting in any way of the remainder of the disclosure. 

EXAMPLE I 

Piasmfd Constnxrtions and Preparation of hGH-phagemld Particles 

The piasmid phGH-MISglll (Fig. 1), was constnxted from M13K07^ and the hGH producing plasmid, 
pB0473 {Cunningham, B. C, et al. , Sciepc^ . 243:1330*1336, [1989]). A synthetfc oligonucleotide 5'-AGC- 
35 TGT-GGC>TTC- GGG-CCC- TTA-GCA-TTT-AAT-GCG>GTA-3' was used to introduce a unique Apal 

restriction site (underlined) into pB0473 after the final Phe191 codon of hGH. The oligonucleotide 5'-TTC- 
ACA-AAC-GAA -GGG-CCC-C TA-ATT-AAA-GCC-AGA-3' was used to introduce a unique Apal restriction site 
(underiined), and a Giu197-tO'amber stop codon (bold lettering) into M13K07 gene III. The oligonucleotide 5'- 
CAA-TAA-TAA'GGG- GCT-AGC -CAA-AAG-AAC'TGG'3' introduces a unique Nhe\ site (underlined) after the 
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3' end of the gene III coding sequence. The resulting 650 base pair (bp) Apai-Nhel tragmem from the doubly 
mutated M13K07 gene til was doned into the large Apa\'f^e\ fragment of pB0473 to create the plasmid, 
pSOl32. This fuses the carboxyf terminus of hGH (Phel91) to the Prol98 residue of the gene III protein with the 
insertion of a glycine residue encoded from the Apa\ site and places the fusion protein under control of the E. coli 
5 alkaline phosphatase (phoA) promoter and stil secretion signal sequence (Chang, C. N., etai , Gaoa, 55:189-196, 
[19871). For indudbie expression of the fusion protein in rich media, we replaced the phoA promoter with the iac 
promoter and operator. A 138 bp EccH\-Xba\ fragment containing the lac promoter, operator, and Cap binding 
site was produced by PCR of plasmid pUC1 19 using the oligonucleotides 5'- 

CAGGAGA£MII£CCGAGTGGAAA-3' and 5'-CTGTT ICIAfiAGTOAAAnGHA-S' that flank the desired 

1 0 lac sequences and Introduce ihe EcoRI and Xbal restriction sites (underlined). This lac fragment was gel purified 
and tigated ^to the large EcoRI-Xbai fragment of pS0132 to create the plasmid, phGH-MISgliL The sequences 
of all taitored DNA junctions were verified by the dideoxy sequence method (Sanger, F., et a/. Proc. Natl. A^^ad. 
SsiLLLSA 74:5463-5467, (1977)). The R64A variant hGH phagemid was constnjcted as follows: the Nsil-BglH 
mutated fragment of hGH (Cunninghamef a/, supra ) encoding the Arg64 to Ala substitution (R64A) 

1 5 (Cunningham, B. C, Wells, J. A., Science. 244:1081-1085, [1989]) was ck>ned between the con-esponding 

restrk;tion sites in the phGH-M13glll plasmid (Fig. 1) to replace the wild-type hGH sequence. The R64A hGH 
phagemki particles were propagated and titered as described betow for the wild-type hGH-phagemid. 

Piasmids were transformed into a male strain of £ co// (JM1 01 ) and selected on carbenicillin plates. A 
single transformant was grown in 2 ml 2YT medium for 4 h at 37*C and infected with 50 of M13K07 helper 

20 phage. The infected culture was diluted into 30 ml 2YT, grown overnight, and phagemid particles were harvested 
by precipitation with polyethylene glycol (Vienra, J., Messing, J. .Methods in En?vmoiogv . 153:3-1 1 , 11987]). 
Typical phagemid particle titers ranged from 2 to 5 x 10^^ cfu/ml. The particles were purified to homogeneity by 
CsCI density centrifugation (Day, L.A. J. Mo!. Biol.. 39:265-277, [1969]) to remove any fusion protein not 
attached to virions. 

25 EXAMPLE 11 

Rabbit polyclonal antibodies to hGH were purified with protein A, and coated onto miaotiter plates 
(Nunc) at a concentration of 2 ^g/ml in 50 mM sodium carbonate buffer (pH 10) at 4'C for 16-20 hours. After 
waging in PBS containing 0.05% Tween 20. hGH or hGH-phagemid particles were serially diluted from 2.0 - 
3 0 0.002 nM in buffer A (50 mM Tris (pH 7£), 50 mM NaCl, 2 mM EDTA, 5 mg/mi bovine senjm albumin, and 0.05% 
Tween 20). After 2 hours at room temperature (rt), the plates were washed well and the indicated Mab 
(Cunninghame/ a/, supra ) was added at 1 ^ig/mi in buffer A for 2 hours at rt Following washing, horseradish 
peroxkiase conjugated goat anti-mouse JgG antibody was bound at rt for 1 hour. After a final wash, the 
peroxidase activity was assayed with the substrate, o-phenytenediamine. 
35 EXAMPLE IK 

Coupling of the hGH Binding Protein to PoJyaoylamkJe Beads and Binding Enrichments 
Oxirane polyacrylamide beads (Sigma) were conjugated to the purified extracellular domain of the hGH 
receptor (hGHbp) (Fuh, 6., etal., J. Biol. Chem.. 265:31 1 1-31 15 [1990]) containing an extra cysteine residue 
Introduced by site-directed mutagenesis at position 237 that does not affect binding of hGH (J. Wells, 
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unpubitsheci). The hGHbp was conjugated as recommended by the si4)pt«r to a ievel of 17 pmol hGHbp/mg dry 
oxirane bead, as measured by binding of p25|j ^qh |q ^^^^ Subsequently, any unreacted oxirane groups were 
blocked with BSA and Tris. As a control for non-specific binding of phagemid partictes, BSA was similarly 
coupled to the beads. Buffer for adsorption and washing contained 10 mM Tris-HC! (pH 7.5), 1 mM EDTA, 50 mM 

.5 NaCJ, 1 mg/ml BSA, and 0.02% Tween 20. EkJtion buffers contained wash buffer plus 200 nM hGH or 0.2 M 
glycine {pH 2.1). Parental phage M13K07 was mixed with hGH phagemid particies at a ratio of nearty 3000:1 
(original mixture) and tumbled for 8-12 h with a 5 ^li aSquot (0.2 mg of acrylamide beads) of eittier absorbent in a 
50 ^1 volume at room tempera^re. The beads were peHeted by centiifugation and the supemate carefully 
removed. The beads were resuspended in 200 pJ wash buffer and tumbled at room temperature for 4 hours 

1 0 (wash 1 ). After a second wash (wash 2), the beads were eluted twice with 200 nM hGH tor 6-10 hours each 
(eiuate 1 , eluate 2). The final elution was with a glycine buffer (pH 2. 1 ) for 4 hours to remove remaining hGH 
phagemid partides {eiuate 3). Each fraction was diluted appropriately in 2YT media, mixed with fresh JM101, 
incubated at 37'C for 5 minutes, and plated with 3 ml of 2YT soft agar on LB or LB carbenlcillin plates. 

EXAMPLE rV 

1 5 Construction of hGH-phagemtd Partides with a Mixture of Gene 111 Products 

The gene III protein is composed of 410 residues divided into two domains that are separated by a 
flexible linker sequence (Armstrong, J., et FEBS Lett.. 135:167-172, [1981]). The amino-terminal domain is 
required for attachment to the pili of £ colt, while the carboxyt-terminal domain is imbedded in the phage coat and 
required for proper phage assembly (Crissman, j. W., Smith, G. P., Virology. 132:445-455, [19S4]). The signal 

2 0 sequence and amino-terminal domain of gene ill was replaced with the stil sigial and entire hGH gene (Chang ef a/. 
supra) by fusion to residue 198 in the cartxjxyl -terminal domain of gene 111 (Fig. 1). The hGH-gene ill fusion was 
placed under control of the lac promoter/operator in a ptasmid {phGH-M13glil; Rg. 1) containing the pBR322 
p-iactamase gene and Col El replication origin, and the phage f1 intergenic region. The vector can be easily 
maintained as a small ii^asmid vector by selection on carbenidllin, which avoids relying on a functional gene III fusion 

25 for propagation. Alternatively, the plasmid can be efficiently packaged into virions (called phagemid particles) by 
infectton with helper phage such as M13K07 (Viera et aL supra ) which avoids problems of phage assembly. 
Phagemid infectivtty titers based upon transduction to carbenicillin resistance in mis system varied from 2-5 x 
10^ ^cok>ny forming units (cfu)/mL The titer of the M13K07 helper phage in these phagemid stocks is -10^^ 
plaque forming units (pfu)/ml. 

30 With this system we confirmed previous studies (Parmley, Smith supra) that homogeneous expression of 

large proteins fused to gene 111 is deleterious to phage production (data not shown). For example, induction of the 
lac promoter in phGH-MISgill by addition of iPTG produced tow phagemW titers. Moreover, phagemid particles 
;xoduced by co4nfection with M13K07 containing an amber mutation in gene III gave very low phagemid titers 
(<10^0 cfu/ml). We believed that multiple copies of the gene 111 fusion attached to the phagemid surface could 

35 lead to multiple point attachment (the "chelate effecr) of the fusion phage to the immobilized target protein. 
Therefore to control the fusion protein copy number we limited transcription of the hGH-gene 111 fusion by 
cuituring the plasmid in £ coif JM101 (lacl^ which contains a constitulively high level of the lac repressor protein. 
The E CO// JM101 cultures containing phGH-M13gUI were best propagated and infected with M13K07 in the 
absence of the lac operon inducer (IPTG); however, this system is flexible so that co-expression of other gene 111 
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fusion proteins can be balanced. We estimate that about 10% of the phagemid partides contain one copy of the 
hGH gene Hi fusion protein from the ratio of the amount of hGH per virion (based on hGH immuno-reactive materia! 
in CsCI gradient purified phagemid). Therefore, tiie titer of fusion phage cfispiaying the hGH gene III fusion is 
about 2 - 5 X iO^^/ml TNs number is much greater than the titer of £ cdi (-lO^ to 10%mi) Hi the culture From 
5 which they are derived. Thus, on average every £ cofi cell produces 10-100 copies of phage decorated with an 
hGH gene \\i fusion protein. 

EXAMPLE V 
Structural Integrfty of the hGH-gm II Fusion 

immunobiot analysis (Rg. 2) of the hGH-gene III phagemid show that hGH cross-reactive materia! 

1 0 comigrates with phagemid particles in agarose gels. This indicates that the hGH is tightly associated with 

phagemid partides. The hGH-gene III fusion protein from the phagemid parties runs as a single immuno-stained 
band showing that there is tittle degradation of the hGH when it is attached to gene ill Wild-type gene III protein 
is clearly present because about 25% of the phagemid particles are infectious. This is comparable to specific 
infecttvity estimates made for wild-type M13 phage that are similarly purified (by CsCl density gradients) and 

1 5 concentrations estimated by UV absorbance {Smith, G. P. supra and Parmley, Smith supra) Thus, both wild-type 
gene IK and the hGH-gene HI fusion proteins are displayed in the phage pool. 

It was important to confirm that the ternary stax;ture of the displayed hGH was maintained in order to 
have confidence that results from binding selections will translate to the native protein. We used monoclonal 
antibodies (Mabs) to hGH to evaluate the structural integrity of the displayed hGH gene III fusion protein (Tabie 

20 I). 

TABLE L Binding Of Bght Different MofXXJtonal 
Antibodies (Mab's) to hGH and hGH Phagemid Particles* 
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Mab 


iCso (nM) 




hGH 


hGH-phagemid 


1 


0.4 


0.4 


2 


0.04 


0.04 


3 


0.2 


0.2 


4 


0.1 


0.1 


S 


0.2 


>2.0 


6 


0.07 


0.2 


7 


0.1 


0.1 


8 


0.1 


0.1 



•Values given represent the concentration (nM) of hGH or hGH-phagemid partides to give half-maximal binding to 
the particular Mab. Standard en^ors in these measurements are typically at or below ±30% of the reported value. 
See Materials and Methods for furtiier details. 

30 The epitopes on hGH for these Mabs have been mapped (Cunningham et aL supra) and binding for 7 of 

8 Mabs requires that hGH be property folded. The IC50 values for all Mabs were equivalent to wild-type hGH 
except for Mab 5 and 6 . Both Mabs 5 and 6 are knovm to have binding determinants near the carboxyl-terminus of 
hGH which is blocked in the gene III fusion protein. The relative IC50 value for Mab1 which reacts with both native 
and denatured hGH is unchanged compared to the confomfiatfonaily sensitive Mabs 2-5, 7 and 8. Thus, Mabi 

35 serves as a good internal control for any errors in matching the concentration of the hGH standard to that of the 
hGH-gene III fusion. 



25 



EXAMPLE VI 
Birxfing Enrk^hmerns on Recepta AfM 

Previous workers (Parmiey, Smith supra ; Scott, Smith supra; Cwiria etaJ. supra; and Devlin etal. 

5 supra) have fractionated phage by panning with streptavidin coated polystyrene petri dishes or miaotiter plates. 
However, chromatographic systems would allow more efficient fractionation of phagemid particles displaying 
mutant proteins with different binding affinities. We chose non-porous oxirane beads (Sigma) to avoid trapping 
of phagemid particies in the chromatographic resin. Furthermore, these beads have a small particle size (1 ^xm) to 
maxirrize the surface area to mass ra^o. The extracellular domain of the hGH receptor (hGHbp) (Fuh et al. , 

1 0 supra) containing a free cysteine residue was efficiently coupled to these beads and phagemid particles showed 
very low non-specific binding to beads coupled only to bovine serum albumin (Table il), 

TABLE II. 

15 Specific Binding of Hormone Phage to hGHbp-coated 

Beads Provides an Enrichment for hGH-phage over M13K07 Phage* 



Sample 


Absorbent^ 


Total pfu 


Total cfu 


Ratio (cfu/pfu) 


Enrichnt>ent§ 


Original mtxturet 


83 X 10'''* 


2.9 X 10^ 


3.5 X 10*"* 


(1) 


Supernatant 


BSA 


7.4 X 10"*'' 


2,8 X 10^ 


3.8 X 10-4 


1.1 




hGHbp 


7.6 X 10"* 


3.3 X 10^ 


4.3 X 10-4 


1.2 


Wash 1 


BSA 


1.1 X 10"**^ 


6.0 X 10^ 


5.5 X 10-4 


1.6 




hGHbp 


1.9 X 10'*^ 


1.7 X 10^ 


8.9 X 10-4 


2.5 


Wash 2 


BSA 


5.9 X 10*^ 


2.8 X 10"* 


4.7 X 10-4 


1,3 




hGHbp 


4.9 X lO'' 


2.7x 106 


5.5 X 10-2 


1.6 X 102 


Eluate 1 (hGH)BSA 


1.1 X 10^ 


1.9 X 10^ 


1.7 X 10-3 


4.9 




hGHbp 


1.2 X 10^ 


2.1 X 10^ 


1.8 


5.1 X 10^ 


Eluate 2 (hGH)BSA 


59x 10^ 


1.2 X 103 


2.0 X 10-3 


5.7 




hGHbp 


5.5 X 10^ 


1.3 X 10^ 


2.4 


6,9 X 10^ 


Eluate3(pH 2.1)BSA 


4.6 x 10^ 


2.0 X 10^ 


4.3 X 10-3 


12.3 




hGHbp 


3.8 x 10^ 


4.0 X 10^ 


10.5 


3.0x 104 



*The titers of M13K07 and hGH-phagemid particles in each fraction was determined 
35 by multiplying the number of plaque forming units (pfu) or carbeniclllin resistant 
colony forming units (cfu) by the dilution factor, respectively* See Example IV for 
details. 

tThe ratio of M13K07 to hGH-phagemid particles was adjusted to 3000:1 In the 
original mixture. 
40 ^Absorbents were conjugated with BSA or hGHbp. 

§Enrichments are calculated by dividing the cfu/pfu ratio after each step by cfu/pfu 
ratio in the original mixture. 

In a typical enrichment experiment (Table II), one part of hGH phagemid was mixed with >3,000 parts 

M13K07 phage. After one cycle of binding and elution, 10^ phage were recovered and the ratio of i:^gemid to 

4 5 M13K07 phage was 2 to 1 . Thus, a single binding selection step gave >5000-fold enrichment. Additional elutions 

with free hGH or acid treatment to remove remaining phagemids produced even greater enrichments. The 

enrichments are comparable to those obtained by Smith and coworkers using batch elation from coated 

polystyrene plates (Smith, G.P. supra and Parmeiy» Smith stpra ) however much smaller volumes are used on the 
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beads (200 (il vs. 6 ml). There was almost no enrichment for the hGH phagemid over M13K07 when we used 
beads linked onfy to BSA. The slight enrichment observed for control beads {-10-fold for pH 2,1 elution; Table 
2) may result from trace contaminants of bovine growth hormone binding protein present in the BSA linked to the 
bead. Nevertheless these data show the enrichments for the hGH phage depend upon the presence of the hGHbp 

6 on the bead suggesting binding occurs by specific interactbn between hGH and the hGHbp, 

We evaluated the enrichment for wiW-type hGH over a weaker binding variant of the hGH on fusion 
phagemids to further demonstrate enrichment specificity, and to link the reduction in binding affinity for the 
purified hormones to enrichment factors after panning fusion phagemkfs. A fusion phagemid was constructed 
wth an hGH mutant in which Arg64 was substituted with Ala (R64A). The R64A variant hormone is about 20- 

1 0 foW reduced in receptor binding affinity compared to hGH (Kd values of 7.1 nM and 0,34 nM, respectively 

[Cunningham, Weils, supra]). The titers of the R64A hGH-gene III fuscn phagemkJ were comparable to those ol 
wiid-type hGH phagemid. After one round of binding and elution (Tabte Hi) the wild-type hGH phagemid was 
enriched from a mixture of the two phagemids plus M13K07 by 8-fokj relative to the phagemid R64A, and -10^ 
relative to M13K07 helper phage. 

15 

TABLE hGHbp^oated Beads Select for hGH F^emids 
Over 8 Weaker Binding hGH Variant Phagemkf 

Control t>eads hGHbp beads 

20 

Sample WT phagennid enrichnr>ent WT phagemid enrichnient 

total phagemid for WT/R64A total phagemid lor WT/R64A 



Origir^al mixture 8/20 (1) 8/20 (1) 

25 Supernatant ND - 4/10 1.0 

Elution 1 (hGH) 7/20 0.8 17/20 8,5* 

Eiution2(pH2.1) 11/20 1.8 21/27 5.2 



*The parent M13K07 phage, wiid-type hGH phagemid and R64A phagemid particles were mixed at a ratio of 
30 10^:0.4X).6. Binding selections were carried out using beads linked with BSA (control beads) or with the hGHbp 
{hGHbp beads) as described in Tabie ii and the Materials and Methods After each step, plasmid DNA was 
isolated{arnboim, H. C, Doly, J. , Nucleic Acids Res. . 7:1513-1523, [1979]) from carbenicillin resistant colonies 
and analyzed by restriction analysis to determine if it contained the wiid-type hGH or the R64A hGH gene IN 
fusion, 

35 tihe enrichment for wild-type hGH phagemid over R64A mutant was calculated from the ratio of hGH phagemid 
present after each step to that present in the original mixture (8/20), divided by the corresponding ratio for 
R64A phagemids. WT » wiid-type; ND » not determined. 

♦The enrichment for phagemid over total M13K07 parental phage was -10^ after this step. 

Condusions 

40 By displaying a mixture of wild-type gene lit and the gene 111 fusion proton on phagemid particles one can 

assemble and propagate virions that display a large and proper folded proton as a fusion to gene 111. The copy 
number of the gene 111 fusion protein can be effectively controlled to avoid 'chelate effects" yet maintained at high 
enou^ levels in the phagemid pool to permit panning of large epitope libraries (>10"^^). We have shown that hGH 
(a 22 kD protein) can be displayed in its native folded form. Binding selections performed on receptor affinity 

45 beads eiuted with free hGH, efficiently enriched for wiid-type hGH phagemids over a mutant hGH phagemid shown 
to have reduced receptor binding affinity. Thus, it is possible to sort phagemid partk^les whose binding constants 
are down in the nanomolar range. 
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Protein-protein arxi antibody-antigen interactions are ciominatecf by discontinuous epitopes (Janin, J., 

etai , J. Moi. Btol.. 204:155-164, [1988]; Argos, P., Prot. Eng. . 2:10M13, [1988]; Bartow, DJ.^etaL , 

322:747*748, [1987]; and Davies, D.R., etai , J. BioL Chem.. 2 63:10541-10544. [1988]); that is the residues 

directly involved in binding are ciose in tertiary structure but separated by residues not involved in binding. The 

5 screening system presented here should altow one to analyze more conveniently protein-receptor interactions and 

isolate discontinuous epitopes in proteins with new and high aSirvty binding properties. 

EXAMPLE VK 
Seiectk)n Of hGK Mutants from a Ubrary Rarxk^^ 
hGH Codons 172, 174, 176, 178 

10 Cgnstructign of template 

A mutant of the hGH-gene 111 fusion protein was constructed using the method of Kunkei.,ef ai Meth. 
Enzvmo!. 1S4, 367-382 [1987]. Template DNA was prepared by growing the piasmid pS0132 (containing the 
natural hGH gene fused to the carboxy-terminal half of M13 gene 111, under control of tf>e alkaline phosphatase 
promoter) in CJ236 cells with M13-K07 phage added as helper. Single-stranded, uracil-contatning DNA was 

1 5 prepared for mutagenesis to introduce (1) a mutation in hGH which would greatly reduce binding to the hGH 
binding protein (hGHbp); and (2) a unique restriction site (Kpn!) which could be used for assaying for - and 
selecting against - parental background phage. Oiigonucleotide-directed mutagenesis was carried out using T7 
DNA polymerase and the following c^igodeoxy-nudeotide: 

Gly Thr 

20 hGH codon: 178 179 

5*-G ACA TTC CTG QGZ AQP GTG CAG T-3* 

< Kpnl > 

This oligo introduces the Kpn! site as shown, along with mutations (R178G, I179T) in hGH. These mutations are 

predicted to reduce binding of hGH to hGHbp by more than 30-fold- Clones from the mutagenesis were screened 
25 by Kpnl digestion and confirmed by dideoxy DNA sequencing. The resulting construct, to be used as a template 

for random mutagenesis, was designated pH0415. 

Random mutagenesis within h6iix4 of hGH 

Codons 172, 174, 176, 178 were teirgeted for random mutagenesis in hGH, again using the method of 

Kunkel. Single-stranded template from pHD415 was prepared as above and mutagene^s viras carried out using 
30 the following pool of oligos: 

hGH codon: 172 174 

5'- GC TTC AGG AAG GAC ATG GAC GTC MNS ACA- 

35 He 
176 178 179 

- UHS. CTG HHS. AXC GTG CAG TGC CGC TCT GTG G-3' 

As shown, this oligo pool reverts codon 179 to wild-type (lie), destroys the unique Kpnl site of pH04l5, and 
introduces random codons (NNS, where N- A,G,C, or T and S= G or C) at positions 1 72,1 74,1 76, and 1 78. Using 

4 0 this codon selection in the context of the above sequence, no additional Kpnl sites can be created. The choice of the 
NNS degenerate sequence yields 32 possible codons (inciuding one "stop" codon, and at least one codon for each 
amino acid) at 4 sites, for a total of (32)^= 1,048,576 possible nucleotide sequences (12% of which contain at 
least one stop codon), or (20)'^= 160,000 possible polypeptide sequences plus 34,481 prematurely terminated 
sequences (i.e. sequences containing at least one stc^ codon). 

45 PropaQation of the initial library 
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The mutagenesis products were extracted twice with phenolxihbroform (50:50) and ethanoi 
precipftated with an excess of carrier tRNA to avoid adding salt that woUd confound the subsequent 
electroporation step. Approximately 50 ng (15 fmols) of DNA was electroporated into WJM101 cells (2.8 x 10"*^ 
cells/ml) in 45 total volume in a 0,2 cm cuvette at a voltage setting of 2.49 kV with a single pulse (time 
5 constant « 47 msec.). 

The cells were a»owed to recover 1 hour at 37^C with shaking, then mixed with 25 ml 2YT medium, 1 00 
M^g/mL carbenidllin, arxl M13*K07 (multiplicity of infection = 1000). Plating of serial dilutions from this culture 
onto carbenidllin-containing media Bidicated that 8.2 x 10^ etectrotransformants were obtained. After 10' at 
23^C, the culture was incubated overnight (15 hours) at ST^C with shaking. 

1 0 After overnight incubation, the ceils were pelleted, and doubie-stranded DNA (dsONA), designated 

pLIBI , was prepared by tie alkaline lysis method. The supernatant was spun again to remove any remaining cells, 
and the phage, designated phage pool 4)1 , were PEG-predpitated and resuspended in 1 mL STE buffer (1 0 mM 
Tris, pH 7.6, 1 mM EOT A, 50 mM NaCI). Phage titers were measured as colony-forming units (CPU) for the 
recombinant phagemid containing hGH-g3p gene III fusion (hGH-g^) plasmid, and piaque-forming units (PFU) for 

1 5 the M13-K07 helper phage. 

Bincfino setection using tmmobiltzed hGHbp 

1. BINDING: An aliquot of phage pool <t>1 (6 x lO^ CPU, 6 x 10^ PFU) was diluted 4.5-foid in buffer A 
(Phosphate-buffered saline, 0.5% BSA, 0.05% Tween-20, 0.01% thimerosal) and mixed with a 5 iiL suspension of 
oxirane-polyacryiamide beads coupled to the hGHbp containing a Ser237 Cys mutation (350 fmols) in a 1 .5 mL 

2 0 sitated polypropylene tube. As a control, an equivalent aliquot of phage vi^re mixed in a separate tube with beads 

that had been coated with BSA only. The ^ge were altowed to bind to the beads by incubating 3 hours at room 
temperature {23<^C) with slow rotation (approximately 7 RPM). Subsequent steps were carried out with a 
constant volume of 200p.L and at room temperature. 

2. WASH: The beads were spun 15 sec., and the s^pematant was removed (Sup. 1). To remove 

25 phage/phagemid not spedfically bound, the beads were washed twice by resuspending in buffer A, then pelleting. 
A final wash consisted of rotating the beads in buffer A for 2 hours. 

3. hGH ELLiTION: Phage/phagemid binding weakly lo the beads were removed by stepwise eiulion with 
hGH. in the first step, the beads were rotated with buffer A containing 2 nM hGH. After 17 hours , the beads 
were pelleted and resuspended in buffer A containing 20 nM hGH and rofeted for 3 hours, then pelleted. In the 

3 0 final hGH wash, the beads were suspended in buffer A containing 200 nM hGH and rotated for 3 hours then 

pelleted. 

4. GLYCINE ELUTION: To remove the tightest-binding phagemid (i.e. those still bound after the hGH 
washes), beads were suspended in Gtydne buffer (1 ^Glydne, pH 2.0 with HQ), rotated 2 hours and pelleted. 
The supeniatant (fraction "G*; 200kL) was neutralized by adding 30 jxL of 1 M Tris base. 

35 Fraction G eiuted from the hGHbp-beads (1 x 10^ CPU, 5 x 10^ PFU) was not substantially enriched 

for phagemid over K07 helper phage. We believe this resulted from the fact that K07 phage packaged during 
propagatton of the recombinant phagemid display the hGH-g3p fusion. 



However, when compared with fraction G eluted from the BSA-coated control beads, the hGHbp-beads 
yielded 14 times as many CPU's. This reflects the enric^vnent of tight-binding hGH-disptaying phagemid over 
nonspedficaliy-binding phagemid. 

5. PROPAGATION: An aliquot (4.3 x CPU) of fraction G eluted from the hGHbp-beads was used 
5 to infect log-phase WJM101 ceils. Transductions were carried out 1^ mixing 100 nL fraction G with 1 mL WJMIOI 
cells, incubating 20 min. at 37^C, then adding K07 (muitipiictty of infection* 1000). Cultures (25 mL 2YT plus 
carttenidllin) were grown as described above and ^ second pool of phage (Library 1G, for first glycine elution) 
were prepared as described above. 

Phage from library 1G (Rg. 3) were selected for binding to hGHbp beads as described above. Fraction 
10 G eluted from hGHbp beads contained 30 times as many CPU's as fraction G eluted from BSA-beads in this 
selection. Again, an aliquot of fraction G was propagated in WJMIOI cells to yield library 1G2 (indicating that 
this library had been twice selected by glycine elution). DouWe-stranded DNA (pLIB 1G^) was also prepared 
from this culture. 

Kpnl assay and resWctfen-setectlon of fePNA 

15 To reduce the level of background {Kpnl+) template, an aliquot (about 0.5 ^g) of pLIB IG^ was 

digested with Kpnl and electroporated into WJM101 cells. These cells were grown in the presence of K07 
(multiplicity of infections 100) as desaibed for the initial library, and a new phage pool, pLIB 3, was prepared 
(Rg. 3). 

In addition, an aliquot (about 0.5 ^ig) of dsDNA from the initial library (pLlB1) was digested with Kpnl 
20 and electroporated directly into WJMIOI cells. Transformanls were allowed to recover as above, infected with 
M13-K07, and grown overnight to obtain a new library of phage, designated phage Library 2 (Fjg. 3). 
Suw$s sh^rwx te ofs g*Ktign 

Phagemid binding, elution, and propagation were carried out in successive rounds for phagemid derived 
from both pLIB 2 and pUB 3 (Rg. 3) as desaibed above, except that (1) an excess (lO-fold over CPU) of 
25 purified K07 phage (not displaying hGH) was added in the bead-binding cocktail, and (2) the hGH stepwise 
eiutions were replaced with brief washings of buffer A alone. Also, in some cases, XL1 -Blue cells were used for 
phagemid propagation. 

An additional digestion of dsDNA with Kpnl was carried out on pLlB 2G^ and on pLIB 3G^ before the 
final round of bead-binding selection (Rg. 3). 
30 DNA Sgquendhg of s^fected phggfflltfs 

Four independently isolated ctones from LIB 4G^ and four independently isolated ctones from LIB 5G^ 
were sequenced by dideoxy sequencing. All eight of these ctones had kJentical DNA sequences: 

hGH codon: 172 174 176 178 

35 5' -AAG GTC TCC ACA TAG CTG AGG ATC-3' 

Thus, all these encode the same mutant of hGH: (E174S, P176Y). Residue 172 in these clones is Lys as in wild- 
type. The codon selected for 172 is also kJentical to wild-type hGH. This is not surprising since AAG is the only 
lysine-codon possible from a degenerate "NNS" codon set ReskJue 178-Arg is also the same as wild-type, but 
here, the codon selected from the library was AAG instead of CGC as is found in wild-type hGH, even though the 
4 0 latter codon is also possible using the "NNS* codon set. 
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Moltipllcltv Of K Q7 In fection 

The multipficity of infection of K07 infection is an importam parameter in the propagation of 
recombinant phagemids. The K07 multiplidty of infection must be high enough to insure that virtually all cells 
transformed or transfecled with phagemid are able to package new phagemid partides. Furthermore, the 
5 concentration of wiW-type gene HI in each cell should be kept high to reduce the possibility of multiple hGH-gene Hi 
fusion molecules being displayed on each phagemid particte, thereby reducing chelate effects in binding. However, 
if the K07 multiplicity of infedion is too high, the packaging of K07 will compete with that of recombinant 
phagemid. We find that acceptable phagemid yieWs, with only 1-10% background K07 phage, are obtained when 
the K07 multiplicity of infection is 100. 

10 

TabtefV. 

Phage Pool moi (K07) Enrichment hGHbp/BSA beads Fraction Kpnl 

CFU/PFU 



L1B1 1000 ND 14 0.44 

L1B1G 1000 ND 30 0.57 

LIB 3 100 ND 1.7 0.26 

LIB3G3 10 ND 8.5 0.18 

UBSG"^ 100 460 220 0.13 

UBS 100 ND 15 ND 

LIB 2 100 ND 1.7 <0.05 

LIB2G 10 ND 4.1 <0.10 

LIB2G2 100 1000 27 0.18 

LIB 4 100 170 38 ND 



Phage pools are labelled as shown (Fig. 3). The multiplicity of infection (moi) refers to the multiplicity of K07 
infection (PFU/cells) in the propagation of phagemid. The enrichment of CFU over PFU is shown in those cases 

1 5 where purified K07 was added in the binding step. The ratio of CFU eiuting from hGHbp-beads over CFU eiuting 
from BSA-beads is shown. The fraction of KpnI-containing template (i.e., pH0415) remaining in the pool was 
determined by digesting dsDN A with Kpnl plus EcoR I, running the products on a 1 % agarose gel . and laser- 
scanning a negative of the ethidium bromide-stained DNA. 
Receptor^bindlng affinity of the hormone hGHfE174& F176Y) 

20 The fact that a single done was isolated from two different pathways of selection {Bg, 3) suggested 

that the double mutant (E174S.F176Y) binds strongly to hGHbp. To determine the affinity of this mutant of 
hGH for hGHbp, we constructed this mutant of hGH by site-directed mutagenesis, using a plasmid (pB0720) 
which contains the wild-type hGH gene as template and the foltowing oligonudeotide which changes codons 1 74 and 
176: 

25 hGH codon: 172 174 17$ 178 

Lys Ser Tyr Arg 

5'- ATG GAC AAG GTS ICG ACA T^C CTG CGC ATC GTG -3' 

The resulting constnjct, pH0458B, was transformed into £ coli strain 16C9 for expression of the mutant 
3 0 hormone. Scatchard analysis of competitive binding of hGH(El74S»F176Y) versus ^ ^^l-^GH to hGHbp indicated 
that me (E174S,F176Y) mutant has a tending affinity at least 5.0-foId tighter than that of wild-type hGH. 
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EXAMPLE VIII 

SELECTION OF hGH VARIAKTS FROM A 
5 HEUX-4 R/^DOM CASSEHE UBRARY OF HORMONE-PHAGE 

Human growth hormorve variants were produced by the method of the present invention using the 

phagemid described in figure 9. 

Constructk)nofad^tisabtehomx)r>e^>h^ 

We designed a vector for cassette mutagenesis (Wetls et al„ Gene 34. 315-323 [1985]) and expression 

10 of the hGH-gene 111 fusion protein with the objectives of (1) improving the linkage between hGH and the gene 111 
moiety to more favorably display the hGH moiety on the phage (2) limiting expression of the fusion protein to 
obtain essentially 'monovalent display (3) allowing for restriction nuclease selection against the starting vector, 
(4) eliminating expression of fusion protein from the starting vector, and (5) achieving facile expres^on of the 
corresponding tree hormone from a given hGH-gene III fusion mutant 

1 5 Ptasmid pS0643 was constructed by oligonucleotide-dtrected mutagenesis (Kunkei et a!., Methods 

Enzymol. 154, 367-382 [1907]) of pS0132. which contains pBR322 and f1 origins of replication and expresses an 
hGH-gene )l) fusion protein (hGH residues 1-191, followed by a single G!y readue, fused to Pro-198 of gene IH) 
under the control of the LJ^liM promoter (Bass et al,, Proteins 8, 309-314 l1990])(Rgure 9). Mutagenesis 
was carried out with the oligonucleotide 5'-GGC-AGC-TGT-GGC-T TC-TAG-A GT-GGC-GGC-GGC-TCT- 

20 GGT-3\ which introduces a XbaJ site (underlined) and an amber stop codon (TAG) following Phe-191 of hGH. In 
the resulting construct, pS0643, a portion of gene III was deleted, and two silent mutations (underlined) 
occun-ed, yielding the following junction between hGH and gene 111: 

— hGH > genem > 

25 187 188 189 190 191 am* 249 250 251 252 253 2&1 

dy Ser Cys Gty Pbe Gkx Ser Ciy Qy Oty Ser Gty 

GGC.AGC TGT GGA TIC TAG ACT CCQ COT GGC TCT GGT 

This shortens the total size of the fusion protein from 401 residues in pS0132 to 350 residues in 
3 0 pS0643. Experiments using monocional antibodies against hGH have demonstrated ttvat the hGH portion of the 
new fusion protein, assembled on a phage particle, is more accessible than wfas the previous, longer fusion. 

For propagation of hormone-displaying phage, pS0643 and derivatives can be grown in a amber- 
suppressor strain of E. colt, such as JM101 or XLI-Blue (Bullock et al., BioTechniqoes 5, 376-379 [19871). Shown 
above is substitution of Gfu at the amber codon which occurs m sudE suppressor strains. Suppression with other 
35 amino acids is also possible in various available strains of Lowell known and puWically available. 

To express hGH (or mutants) free of ttie gene Hi portion of the fusion, pS0643 and derivatives can 
simply be grown in a non-sippressor strain such as 16C9. fei this case, the amber codon (TAG) leads to 
termination of translation, which yields free hormone, without the need for an independent DNA construction. 

To create sites for cassette mutagenesis, pS0643 was mutated with the oligonucleotides (1) 5'-CGG- 
40 ACT-GGG-CAQi&IMTG-AAG-CAG-ACC-S', which destroys the unique Ealll site of pS0643; (2) 5'-CTC- 
AAG-AAC-TAC-G GG-TTA-CCC -TGA-CTG-CTT-CAG-GAA-GG-3\ which inserts a unique fislEit Site, a 
single-base frameshift, and a non-amber stop codon (TGA); and (3) 5'-CGC-ATC-GTG-CAG-TGC -AGA-TCT - 
GTG-GAG-GGC-3', which introduces a new EgUl site, to yield the starting vector, pH0509. The addition of a 
frameshift along with a TGA stop codon insures that no genelil-fusion can be produced from the starting vector. 
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The BstEH - figUl segment is cut out of pH0509 and replaced with a DNA cassette, mutated at the codons of 
interest. Other restriction sites for cassette mutagenesis at other locations in hGH have also been introduced into 
the hormone-phage vector. 
Cassette mutagenesis within helix 4 hGH 

5 Codons 172, 174, 176 and 178 of hGH were targeted for random mutagenesis because they all lie on or 

near the surface of hGH arxj contribute signifcantiy to receptor-binding (Cunningham and Weils, Science 244, 
108M085 [1989]); they all Se within a weH-defined structure, occupying 2 "turns' on the same side of heiix 4; 
and they are each substituted by at least one amino acid among known evolutionary variants of hC H. 

We chose to substitute NNS (N=A/G/CyT; S«G/C) at each of the target residues. The choice of the 
1 0 NNS degenerate sequence yields 32 possiWe codons (including at least one codon for each amino acid) at 4 sites, 
for a total of (32)^= 1,048,576 possible nucleotide sequences, or (20)^* 160.000 possible polypeptide 
sequences. Only one stop codon, amber (TAG), is allowed by Us choice of codons, and this codon is suppressit^ 
as Glu in supE strains of E. coli . 

Two degenerate oligonucleotides, with NNS at codons 172, 174, 176, and 178, were synthesized, 
1 5 phosphoryiated, and annealed to construct the mutagenic cassette: S'-GT-TAC-TCT-ACT-GCT-TTC-AGG- 
AAG-GAC-ATG-GAC-NNS-GTC-NNS*ACA-NNS-CTG-NNS-ATC-GTG-CAG-TGC'A-3', and 5'-GA-TCT- 
GCA-CTG-CAC-GAT-SNN-CAG-SNN-TGT-SNN-GAC-SNN-GTC-CAT-GTC-CTT-CCT-GAA-GCA-GTA- 
GA-3'. 

The vector v^as prepared by digesting pH0509 with BsO followed by figUl. The products were run on 
20 a 1% agarose gel and the targe fragment excised, phenol-extracted, and ethanol precipitated. This fragment was 
treated with calf intestinal phosphatase (Boehringer), then pheno!:chloroform extracted, ethanol precipitated, 
and resuspended for ligation with the mutagenic cassette. 

Prwggttton of the Initial ]mn In XlVBiw gells 

Following ligation, the reaction products were again digested with BstEif . then phenol :chloroform 
25 extracted, ethanol precipitated and resuspended in water. (A BstElt recognition site (GGTNACC) is created 
within cassettes which contain a at position 3 of codon 172 and an ACC (Thr) codon at 174. However, 
treatment with BstElt at this step should not select against any of the possible mutagenic cassettes, because 
virtually all cassettes will be heteroduplexes, which cannot be cleaved by the enzyme.) Approximately 150 ng (45 
fmols) of DNA was electroporated into XLI-Blue cells (1.8 x 10^ cells in 0.045 mL) in a 0.2 cm cuvette at a 
30 voltage setting of 2.49 kV with a single pulse (time constant = 4.7 msec.). 

The ceils were allowed to recover 1 hour at 37^ in S.O.C media with shaking, then mixed with 25 mL 
2YT medium, 100 mg/mL carbenicillin, and M13-K07 (moi= 100). After 10' at 23^0, the culture was incubated 
overnight (15 hours) at 37^0 with shaking. Plating of serial dilutions from this culture onto carbeniciitin- 
containing media indicated that 3.9 x 10^ electrotransformants were obtained. 
35 After overnight incubation, ^ cells were pelleted, and double-stranded DNA (dsDNA), designated 

pH0529E (the initial library), was prepared by the alkaline lysis method. The supernatant was spun again to 
remove any remaining cells, and the phage, designated phage pool 4>H0529E (the initial library of phage), vi«re 
PEG-precipttated and resuspended in 1 mL STE buffer (10 mM Tris. pH 7.6, 1 mM EDTA, 50 mM NaCI). Phage 
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titers were measured as cok>ny-formir>g units (CFU) for the recombir^t phagemid containing hGH-g3p. 
Approximately 4.5 x lO'^S CFU were obtained from tfie starting library, 

t?ggengracy the starBnq library 

From the pool of electrotransfonmants, 58 ctones were sequenced in the region of the BstElt-Bglll 
5 cassette. Of these, 17% corresponded to the starting vector, 17% contained at least one frame shift, and 7% 
contained a non-silent (non-terminating) mutation outside the four target codons. We conclude that 41% of tt% 
dones were defective by one of the above measures, leaving a total funcbonal pool of 2.0 x 10^ initial 
transformants. This ni^ber still exceeds the possible number of DN A sequences by nearly 20-foid. Therefore, 
we are confident of having all possible sequences represented in the starting library. 
1 0 We examined the sequences of non-selected phage to evaluate the degree of codon bias in ihB 

mutagenesis (T able V). The results indicated that, although some codons (and amino adds) are under- or over- 
represented relative to the random expectation, the library is extremely diverse, with no evidence of large-scale 
•sibling* degeneracy (Table VI). 

TabteV. 

15 

Codon distribution {per 188 codons) of non-selected hormone phage. Clones were sequenced from the starting 
library {pH0529E). All codons were tabulated, including those from clones which contained spurious mutations 
and/or frameshifts. * Note: the amber stop codon (TAG) is suppressed as Glu in XL1-Blue celts. Highlighted 
codons were over/under-represented by 50% or more. 

20 





Residue 


Numljer expected 


Number feund 






Leu 


17.6 


18 


1.0 


25 


Ser 


17.6 


26 


1.5 




Arg 


17.6 


10 


0.57 




Pro 


11.8 


16 


1.4 




Thr 


11.8 


14 


1,2 


30 


Ala 


11.B 


13 


1.1 




Giy 


11.8 


16 


1.4 




Va! 


11.8 


4 


0.3 




le 


5.9 


2 


0.3 


35 


Met 


5.9 


1 


0.2 




Tyr 


5.9 


1 


0.2 




His 


5.9 


2 


0,3 




Trp 


5.9 


2 


0.3 




Phe 


5,9 


5 


0.9 


40 


Cys 


5.9 


5 


0.9 




Gh 


5.9 


7 


1.2 




Asn 


5.9 


14 


2.4 




Lys 


5.9 


11 


19 




Asp 


5.9 


9 


1.5 


45 


Glu 


5.9 


6 


1,0 




amber* 


5.9 


6 


1.0 
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TaWe VI. 

Non-selected {pH0529E) clones with an open reading frame. 

The notation, e.g. TWGS. (lenotes the hGH mutant 172T/174W/176G/178S. Amber (TAG) codons, translated 
5 as Glu in XL1-B)ue ceils are shown as e. 



Ke NT 


KTEQ 


CVLQ 


TWGS 


HmR 


EASL 


PeER 


FPCL 


SSKE 


LPPS 


NSDF 


ALLL 


SLOP 


HRPS 


PSHP 


QQSN 


LSLe 


SYAP 


GSKT 


NGSK 


ASNG 


TPVT 


LTTE 


EANN 


RSRA 


PSGG 


KNAK 


LCGL 


LWFP 


SRGK 


TGRL 


PAGS 


GLDG 


AKAS 


GRAK 


NDPi 


GNDD 


GTNG 
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Preparation of innmobttized hGHbp and hPRlbp 

immobilized hGNbp ("hGHbp-beads") was prepared as described (Bass et aL, Proteins 8, 309-314 
[1990]). except that wild-type hGHbp (Fuh et ai.. J. Biol. Chem. 265. 3111-3115 [1990)) was used. Competitive 
25 binding experiments with 25(] ^QH indicated that 58 fmols of functional hGHbp were coupled per i± of bead 
suspension. 

Immobilized hPRLbp ("hPRLbp-beads") was prepared as above, using ihe 211 -residue extracellular 
domain of the prolactin receptor (Cunningham et ah, Science 250, 1709-1712 [1990)). Competitive binding 
experiments with [^25j] ^GH rn me presence of 50 zinc indicated that 2.1 fmols of functional hPRLbp were 

3 0 coi4^d per jiL of bead suspension. 

"Blank beads* were prepared by treating the oxirane-acrylamide beads with 0,6 M ethanolamine (pH 
9.2) for 15 hours at 4^^C. 

Binding setection uanq lnTTX)blfoed hGHbP and hPRLbp 

Binding of homf)one-phage to beads was carried out in one of the following buffers: Buffer A (PBS, 

35 0.5% BSA, 0.05% Tween 20, 0,01% thimerosal) for selections using hGHbp and blank beads; Buffer B (50 mM 
tris pH 7.5, 10 mM MgCl2. 0.5% BSA, 0.05% Tween 20, 1 00 mM ZnCIa) tor selections using hPRLbp in the 
presence of zinc {+ Zn2+); or Buffer C {PBS, 0.5% BSA, 0.05% Tween 20, 0.01% thimerosal, 10 mM EDTA) for 
selections using hPRLbp rn the absence of zinc (+ EDTA). Binding selections were can*ied out according to each 
of the following paths: (1) binding to blank beads, (2) binding to hGHbp-beads, (3) binding to hPRLbp-beads (+ 

40 Zn2+), (4) binding to hPRLbp-beads (+ EDTA), (5) pre-adsorbing twice with hGHbp beads then binding the 
non-adsorbed fraction to hPRLbp-beads 

{•-hGHbp, +hPRLbp' selection), or (6) pre-adsorbing twice with hPRLbp-beads then tending the non-adsorbed 
fraction to hGHbp-beads C-hPRLbp, +hGHbp'' selection). The latter two procedures are expected to enrich 
for mutants binding hPRLbp but not hGHbp, or for mutants binding hGHbp but not hPRLbp, respectively. 

4 5 Bindng and elution of phage was canied out tfi each cycle as follows: 

1. BINDING: An aliquot of bomnone phage (typically 10^ -10^^ CPU) was mixed with an equal amount of 
non-hormone phage (pCAT), diluted into the appropriate buffer (A, B, or C), and mixed with a 10 mL suspension 
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Of hGHbp, hPRLbp or biank beads in a total volume of 200mL in a 1 5 mL polypropyiene tube. The phage were 
allowed to bind to the beads by incubating 1 hour at room temperature {23^0) with stow rotation {approximately 
7 RPM). Subsequent steps were carried out with a constant volume of 200^il and at room temperature. 

2. WASHES: The beads were spun 15 sec,, and the supernatant was rerrK>ved. To reduce the number of 
6 phage not spedficaily bound, the beads were washed 5 times by resuspending briefiy in the appropriate buffer, 

then pelleting. 

3. hGH ELUTION: Phage binding weakly to the beads were removed by elution with hGH. The beads 
were rotated with the appropriate buffer containing 400 nM.hGH for 15-17 hours. The supernatant was saved as 
the "hGH elution* and tf>e beads. The beads were washed by resuspending briefly in buffer and pelleting. 

1 0 4. GLYCINE ELUTION: To remove the tightest-binding phage (i.e. those stilt bound after the hGH 

wash}, beads were suspended m Glycine buffer (Buffer A plus 0.2 M.Giycine, pH 2.0 with HCI), rotated 1 hour and 
pelleted. The supernatant ("Glycine elution"; 200^L) was neutralized by adding 30 mL of 1 M Tris base and 
stored at C. 

5. PROPAGATION: Aiiquots from the hGH eluttons and from the Glycine elutions from each set of 
1 5 beads under each set of conditions were used to infect separate cultures of log-phase XL1 -Biue ceils. 

Transductions were canied out by mixing f^ge with 1 mL XLI-Blue cells, incubating 20 min. at 37^0, then adding 
K07 {moi= 100). Cultures (25 mL 2YT plus carbeniciiiin) were grown as described above and the next pool of 
phage was prepared as described above. 

Phage binding, elution, and propagation were carried out in successive rounds, according to the cycle 
20 described above. For example^ the phage amplified from the hGH etulion from hGHbp-beads were again selected 
on hGHbp-beads and eluted with hGH, then used to infect a new culture of XL1 -Blue cells. Three to five rounds of 
selection and propagation were carried out for each of the selection procedures described above. 
DNA Seouencina of selected phadentids 

From the hGH and Glycine elution steps of each cycle, an aliquot of phage was used to inoculate XLi-Blue 
25 cells, which were plated on LB media containing carbeniciiiin and tetracycline to obtain independent clones from 
each phage pool Single-stranded DNA was prepared from isolated colony and sequenced in the region of the 
mutagenic cassette. The results of DNA sequendng are summarized in terms of the deduced amino acid sequences 
inRgures5,6, 7, and 8. 

30 Expres^on and assafy of hGH nutants 

To determine ti>e binding affinity of some of the selected hGH mutants for the hGHbp, we transformed 
DNA from sequenced clones into E.coli strain 16C9. As described above, this is a non-suppressor strain which 
terminates translation of protein after the final Phe-191 residue of hGH. Single-stranded DNA was used for 
these transformations, but double-stranded DNA or even whole phage can be easily etectroporated into a non- 
35 suppressor strain for expression of free hormone. 

Mutants of hGH were prepared from osmotically shocked cells by ammonium sulfate precipitation as 
described for hGH (Olson et aL, Nature 293, 408-41 1 [19811), and protein concentrations were measured by 
laser densitomoetry of Coomas^-stained SDS-polyacrylamide gel electrophoresis gels, using hGH as standard 
(Cunningham and Wells, Science 244, 108M085 [1989]). 
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The binding affinity of each mutant was determined by ciisplacement of ^25) as described (Spencer 
et ai., J. Btol. Chem. 263, 7862-7867 [19881 ; et al.. J. Biot. Chem. 265, 3111-31 15 [1990]), uang an anti- 
receptor monoclonal antibody {Mab263). 

The results for a number of hGH mutants, selected by different pathways (Rg. 6) are shown in Table 
5 VIL Many of these mutants have a tighter binding affir^ty for hGHbp than wild-type hGH. The most improved 
mutant, KSYR, has a binding affinity 5.6 times greater than that of wild-type hGH. The weakest selected 
mutant, among those assayed was only about 10-foId tower in binding affinity ^ hGH, 

Binding assays may be carried out for mutants selected for hPRLbp-binding. 

10 Table V«. 

Compemive binding to hGHbp 

The selected pool in which each mutant was found is indicated as 1 G (first glycine selection) , 3G (third 
glycine selection), 3H prd hGH selection), 3* (third selection, not binding to hPRlip, but binding to hGHbpl 
15 The number of times each mutant occurred among all sequenced dones is shown (). 
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Mutant 


Kd (nM) 


Kd(mut)/Kd(hGH) 


Pool 


KSYR (6) 


0.06 + 0.01 


0.18 


1G3G 


RSFR 


0.10+0.05 


0.30 


3G 


RAYR 


0-13 + 0.04 


0.37 


3* 


KTYK (2} 


0.16 + 0.04 


0.47 


H,3G 


RSYR (3) 


0.20 + 0.07 


0.58 


1G3H,3G 


KAYR (3) 


0.22 + 0.03 


0.56 


3G 


RFFR (2) 


0.26 + 0.05 


0.76 


3H 


KQYR 


0.33 + 0.03 


1.0 


3G 


KEFR= wt (9) 


0.34+ 0.05 


1.0 


3H,3G,3* 


RTYH 


0.58 + 0.17 


2.0 


3H 


QRYR 


0.83 + 0.14 


2.5 


3* 


KKYK 


1.1 +0.4 


3.2 


3* 


RSFS (2) 


1.1 +0.2 


3.3 


3G,* 


KSNR 


3.1 +0.4 


9.2 


3* 



Nam and non-addWvg gffwts on Mmflng 

At some residues, substitution of a particular amino add has essentially the same effect independent of 
5 0 surrounding residues. For example, substitution of F1 76Y in the background of 1 72R/1 74S reduces binding 
affinity by 2.0-fold (RSFR vs. RSYR). Similarly, in the background of 172K/174A the binding affinity of the 
F176Y mutant (KAYR) is 2.9-fold weaker than the corresponding 176F mutant (KAFR; Cunningham and Wells. 
1989). 
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On the other hand, the bincfing constants determined tor several selected mutants of hGH demonstrate 
non-additive effects of some amino acid substitutions at residues 172, 174, 176, and 178. For example, in the 
background of 172K/176Y, the substitution E174S results in a mutant (KSYR) which binds hGHbp 3J-told 
tighter than the corresponding mutant containing E174A (KAYR). However, in the background of 172R/176Y, 
5 the effects of these El 74 substitutions are reversed. Here, the E174A mutant (RAYR) binds 1.5-fold tighter 
than the E174S mutant (RSYR). 

Such non-additive effects on binding for substitutions at proximal residues illustrate me utility of 
protejn-phage binding selecton as a means of selecting optimized mutants from a library randomized at several 
positions, in the absence of detailed structural information, without such a setedion process, many combinations of 
1 0 substitutions might be tried before finding the optimum mutant. 

SELECTION OF hGH VARIANTS FROM A HEUX-1 RANDOM CASSEHE 
1 5 UBRARY OF HORMONE^PHAGE 

Using the methods described in Example Vlii, we targeted another region of hGH involved in binding to 
the hGHbp and/or hPRLbp, helix 1 residues 10. 14, 18,21. for random mutagenesis in the phGHam-g3p vector 
(also known as pS0643; see Example Vlil). 

20 We chose to use the "amber* hGH-g3 construct (called phGHam-^Sp) because it appears to make the 

target protein, hGH, more accessiWe for binding. This is supported by data from comparative ELISA assays of 
monoclonal antibody binding. Phage produced from both pS0132 (S. Bass, R. Greene, J. A. Wells, Proteins 8, 
309 (1990).) and phGHam-g3 were tested with three antibodies {Medix 2, 1B5.G2, and 5B7.C10) that are known 
to have binding detemfiinants near the carboxyl-temiinus of hGH [B. C. Cunningham, P. Jhurani, P. Ng, J. A. Wetts, 

25 So'eoce 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Sdence 244, 1081 (1989); L. Jin and J. Wells, 
unpublished results), and one antibody (Medix 1) that recognizes deienninants in helices 1 and 3 ([B. C. 
Cunningham, P. Jhurani, P. Ng, J. A. Wells, Soence 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 
244, 1081 (1989)]). Phagemid particles from phGHam-g3 reacted much more strongly vwth antibodies Medix 2, 
1B5.G2, and 5B7.C10 than did phagemid partlctes from pS0132. in particular, binding of pS0132 particles was 

30 reduced by >2000-foW for both Medix 2 and 5B7.C10 and reduced by >25-foW for 1B5.G2 compared to binding 
to Medix 1 . On the other hand, binding of phQHam-g3 phage was weaker by only about 1 .5-fotd, 1 .2-fold, and 2.3- 
foW for the Medix Z, 1B5.G2, and 5B7.C10 antibodies, respectively, compared with binding to MEDIX 1. 
Construe of the heflx l library bv cassette mutap&nesfe 

We mutated reskJues in helix 1 that were previously ktentified by alanine-scanning mutagenesis [B. C. 

3 5 Cunningham. P. Jhurani, R Ng, J. A. Wells, Sdence 243, 1330 (1 989); B. C. Cunningham and J. A. Wells, Science 

244, 1081 (1989), 15, 16) to modulate the binding of the extracellular domains of the hGH and/or hPRL 
receptors (called hGHbp and hPRLbp, respectively). Cassette mutagenesis was carried out essenliany as 
described [J. A. Wells, M. Vasser, D. B. Powers, Gene 34. 315 (1985)]. This library was constructed by cassette 
mutagenesis that fully mutated four residues at a time (see Example VI II) which utilized a mutated version of 

4 0 phGHam-g3 into which unique Kpd (at hGH codon 27)and Xho\ (at hGH codon 6) restriction sites (underlined 

befow) had been inserted by mutagenesis [ T, A. Kunke!. J. D. Roberts. R. A. Zakour, Me^s Enzymot. 154, 367- 
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382] with the otigonucieotides 5'-GCC TTT GAG A GG TAG CA G GAG TIT G-3' and 5'-CCA ACT ATA GCA 
CT C TCQ AG G TCT ATT CGA TAA C-3', respectively. The later oiigo also introduced a +1 frameshift 
{italicized) to terminate translation from the startir>g vector and minimize wild-type background in the phagemid 
library. This strating vector was designated pH0508B. The helix 1 library, which mutated hGH residues 10, 14, 
5 18, 21 , was constructed by ligating to the large Xho\'Kpri\ fragment of pH0508B a cassette made from the 
complementary oltgonucteotides 5 -pTCG AGG CTC KINS GAG AAG GCG ms CTG CGT GCT NNS CGT OTJ 
NNS CAG CTG GCC TTT GAC AGG TAG-3' and 5'i>GT GTC AAA GGC CAG CTG SNN AAG AGG SNN AGG 
AGG CAG SNN GGC GTT GTC SNN GAG CC-3'. The Kpn\ site was destroyed 'w\ the junction of the ligation 
product so that restriction enzyme digestion could be used for analysis of non-mutated background. 

10 The library contained at least 1 0^ independent transfom\ants so that if the library were absolutely 

random (1 0^ different combinations of codons) we would have an average of about 1 0 copies of each possible 
mutated hGH gene. Restriction analysis using Kpni indicated that at least 80% of heiix 1 library constmcts 
contained the inserted cassette. 

Binding enrichments of hGH-phage from the libraries was carried out using hGHbp immobiltzed on 

1 5 oxirane-pdyacrylamide beads {Sigma Ghemicat Go.) as desaribed (Example Vlli). Four residues in helix 1 {F10, 
M14, H18, and H21) were similarly mutated and after 4 and 6 cycles a non-wikJ-type consensus developed (Table 
VIII). Position 10 on the hydrophobk; face of heiix 1 tended to be hydrophobic whereas positions 21 and 18 on the 
hydrophilltc face tended were dominated by Asn; no obvwus consensus was evktent for position 14 (Table tX). 
The binding constants for these mutants of hGH to hGHbp was determined by expressing the free 

2 0 hormone variants in the non-suppressor £ coli strain 1SC9, purifying the protein, and assaying by competitive 
displacement of labelled wt-hGH from hGHbp (see Example VIH). As indicated, several mutants bind tighter to 
hGHbp than doeswt-hGH. 
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Table VIII. 
Selection of hQH helix 1 mutants 

Variants of hGH (ranck>miy mutated at residues FIO, M14, HIS, H21) expressed on phagemid 
5 particles were selected by landing to hGHbp-beads and eluting with hGH {0.4 mKf) buffer 
followed by glycine (0.2 pH 2) buffer (see Example VIII), 
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Tabie IX 

Consensus sequerx:es torn the selected he^ 
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Observed frequency is fraction of atl clones sequenced with the indicated amino acid. The nominal frequency is 
cakxjiated on the tiasis of NNS 32 codon degeneracy. The maximal enrichment factor varies from 1 1 to 32 
depending upon the nominal frequency value for a given residue. Values of [K<j{Ala mut)/K(j(wt hGH)] for single 
alanine mutations were taken from B. C. Cunningham and J. A. Weils, Science 244, 1081 (1989); B. C. Cunningham 
D. J. Henner, J. A. Weils, Sdence 247, 1461 (1990); B. C. Cunningham and J. A. Wells, Proc. Nati Acad Sd USA 
88, 3407 (1991). 
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Table X 

Blndng of purtfied hGH he&x 1 mutants to hGHbp 

5 Competition binding experiments were performed using ^^hGH (wild-type), hGHbp (containing the 

extracellular receptor domain, residues 1-238), and Mab263 [B. C. Cumingham, P. Jhurani, P. Ng J A Wells 
Science 243, 1330 (1989)];. The number P indicates the fractionai occurrence of each mutant among all the clones 
sequenced after one or more rounds of selection. 
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EXAMPLE X 

SELECTION OF hGH VARIANTS FROM A HEUX^ RANE)OM CASSETTE LIBRARY CONTAINING 
PREVIOUSLY FOUND MUTATIONS BY ENRICHMENT OF HORMONE-PH AGE 

30 Pes^gn gf nurtant OTWhs wKh Im&roved binding properties bv ftefativA s efecf on usf ng homionp^ gfi 

Our experience with recruiting non-binding homologs of hGH evolutionary variants suggests that many 
individual amino acid substitutions can be combined to yield cumulatively improved mutants of hGH with respect to 
binding a particular receptor [B. C. Cunningham, D. J. Henner, J. A. V^^ells. Sdence 247, 1461 (1990); B. C. 
Cunningham and J. A. Weils, Proc. Natl. Acad Sd. USA 88, 3407 (1991 ); H. B. Lowman, B. C. Cunningham. J. A. 

35 VJeWs, J. BioL Chem. 266, in press (1991)]. 

The helix 4b library was constructed in an attempt to further improve the helix 4 double mutant 
(E174S/F176Y) selected from the helix 4a library that we found bound tighter to the hGH receptor (see 
Example VIII). With the E174S/F1 76Y hGH mutant as the background starting hormone, residues were mutated 
thatsun-ounded positions 174and 176 on the hydrophilic face of heiix4(R167, D171,T175and 1179) . 

40 C^hStrucCon of the helix 4b library bv ms^hp mtrtflflfrnts 

Cassette mutagenesis was carried out essentially as described [J. A. Wells, M. Vasser, D. B. Powers, 
Gene 34, 31 5 (1 985)]. The helix 4b library, which mutated residues 1 67, 1 71 . 1 75 and 1 79 within the 
E174S/F176Y background, was constructed using cassette mutagenesis that fully mutated four residues at a 
time (see Example VIII) and whteh utilized a mutated version of phGHam-g3 into which unique BsfEl and Sg/li 
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restriction sites had been inserted previousiy (Example Vltl). hito the BstB\'&gf\\ sites of the vector was inserted 
a cassette made from ^ complementary oligomjcieotides 5'-pG TTA CTC TAC TGC TIC NNS AAG GAC ATG 
NNS AAG GTC AGC NNS TAC CTG CGC NNS GTG CAG TGC A-3' and 5 -pGA TCT GCA CTG CAC SNN 
GCG CAG GTA SNN GCT GAC CTl SNN CAT GTC CTT SNN GAA GCA GTA GA-3'. The BstB\ site was 
5 eliminated in the ligated cassette. From the helix 4b library, 15 unselecled clones were sequenced. Of these, none 
lacked a cassette insert, 20% were frame-shifted, and 7% had a non-s8ent mutation. 

Results gf hGHbp enrlcnmeftt 

Binding enhctoents of hGH-phage from the libraries was carried out using hGHbp immobilized on 
oxirane-pofyacryiamlde beads (Sigma Chemical Co.) as described (Example Vlll). After 6 cycles of binding a 
1 0 reasonably dear consensus developed (Table XI). interestingly, alt positions tended to contain polar residues, 
notably Ser, Thr and Asn (XII). 

AssgygfhSH mutants 

The binding constants for some of these mutants of hGH to hGHbp was determined by expressing the 

free hormone variants in the non-suppressor £ coii strain 16C9, pxjrifying tf)e protein, and assaying by competitive 
1 5 displacement of labelled wt-hGH from hGHbp {see Example VIII). As indicated, the binding affinities of several 

helix-4b mutants for hGHbp were tighter than that of wt-hGH Tat>le XIH). 

Rffi&ty-Sfilectlvlly pf hGH variants 

Rnaliy, we have begun to analyze the binding affinity of several of the tighter hGHbp binding mutants 

for their ability to bind to the hPRLbp. The E174S/F176Y mutant binds 200-fold weaker to the hPRLbp than 
20 hGH. The E174T/Fl76y/R178K and R167N/D171S/E174S/F176Y/I179T mutants each bind >500-fold weaker 

to the hPRLbp than hGH. Thus, it is possible to use the produce new receptor selective mutants of hGH by phage 

display technology. 

Honpone-Dhaaemid selection Identifies the infonnatton-CQntent of pa rticular residues 

Of the 12 residues mutated in three hGH-phagemid libraries (Examples Vlll, IX, X), 4 showed a strong, 

25 al^ough not exclusive, conservation of the wild-type residues (K172, T175, F176, and R178). Not surprisingly, 
these were residues that when converted to Ala caused the largest disruptions (4- to 60-foid) In binding affinity 
to the hGHbp. There was a class of 4 other residues (F10, M14, D171, and 1179) where Ala substitutions caused 
weaker effects on binding (2- to 7-fokj) and these positions exhibited little wild-type consensus. Rnally the other 
4 residues (H18, H21. R167, and E174), that promote binding to he hPRLbp but not tiie hGHbp, did not exhibit 

30 any consensus for the wikJ-type hGH sequence by selection on hGHbp-beads. In fact two residues (El 74 and 
H21), where Ala substitutions enhance binding affinity to the hGHbp by 2- to 4-foid (B, C. Cunningham, P. 
Jhurani, P. Ng. J. A. Weils, Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 244, 1081 
(1989); B. C. Cunningham, D. J. Henner, J. A. Wells, Science 247, 1461 (1990); B. C. Cunningham and J. A. Wells, 
Proc. Nat Acad Sd. 3407 (1991)). Thus, the alanine-scanning mutagenesis data correlates reasonably 

35 well with the flexibility to substitute each position. In fact . the reduction in binding affinity caused by alanine 
substitutions [B. C. Cunningham, P. Jhurani, P. Ng, J. A, Wells, Sdence 243, 1330 (1989); B. C. Cunningham and 
J. A. Wells, Science 244, 1081 (1989)J, B. C. Cunningham, D. J. Henner. J. A. Wells, Science 247, 1461 (1990); B. 
C. Cunningham and J. A. Wells, Proc. Natl. Acad. Sd, USA 88, 3407 (1991)] is a reasonable predictor of the 
percentage that the wild-type residue is found in the phagemid pool after 3-6 rounds of selection. The alanine- 
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scanning information is useful for targeting side-chains that modulate binding, and the phage selection is 
appropriate for optimizing them and defining the flexibtltty of each site (and/or combir^ations of sites) for 
substitution- The combination of scanning mutational methods [B. C. Cunningham, P. Jhurani, P. Ng, J. A. Wells, 
Science2AZ, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 244, 1081 (1989}] and phage display is a 
5 powerful approach to designing receptor-ligand interfaces and studying molecular evolution in vitro. 
VarfatiQris on iterative enrichmem of hornxanej)^^ 

in cases where combined mutations in hGH have additive effects on binding affinity to receptor, 
mutations teamed through hormone-phagemid enrichment to improve binding can be combined by simple cutting 
and ligation of restriction fragments or mutagenesis to yteid cumida^ly op^mized mutants of hGH. 

10 On the other hand, mutations in one region of hGH which optimize receptor binding may be structurally or 

functionally incompatible with mutations in an overiapping oc another region of the molecule. In these cases, 
hormone phagemid enrichment can be carried out by one of several variations on the Iterative enridiment approach : 
(1) random DMA libraries can be generated in each of two (or perhaps more) regions of the molecule by cassette 
or another mutagenesis method. Thereafter, a combined library can be created by ligation of restriction 

1 5 fragments from the two DNA libraries; (2) an hGH variant, optimized for binding by mutation in one region of the 
molecule, can be randomly mutated in a second region of the molecule as in the hel!x-4b library example; (3) two or 
more random libraries can be cacMy selected for improved binding by hormone-phagemid enrichment; after this 
"roughing-in" of the optimized binding site, the stiil-partiatly-diverse libraries can be recombined by ligation of 
restriction fragments to generate a single library, partially diverse in two or more regions of the molecules, which 

20 in turn can be further selected for optimized binding using hormone-phagemid enrichment. 
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Table Xi. 

Mut^ phagemicte of hGH selected from helbc 4b li^^ 

Selection of hGH helix 4b mutants {randomly mutated at residues 167, 171, 175, 179), each containing the E174S/F176Y 
double mutant by binding to hGHbp-beads and eluting with hGH (0.4 mM) buffer followed by glycine (0.2 M, pH 2) buffer. 
5 One mutant (+) contained the spurious mutation R1 78H, 
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Table Xit 

Consersus sequerces ^ the selected yb»^ 

Observed frequency is fraction of all ciones sequenced with the indicated amino add. The nominal frequency 
is calculated on the basis of NNS 32 codon degeneracy. The maximal enrichment factor varies from 11 to 16 to 32 
depending upon the nominal frequency value for a given residue. Values of [Kd(Ala mut)/Kd(wt hGH)] for single 
alanine mutations were taken from refs. below; for position 1 75 we only have a value for the T1 75S mutant [B. C. 
Cunningham, P. Jhurani. P. Ng. J. A. Wells, Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 244, 
1081 (1989); B, C. Cunningham, D. J. Henner, J. A. Wells, Science 247. 1461 (1990); B. C. Cunningham and J. A. Wells, 
Proc. Natl, Acad, Sci, USA 86, 3407 (1991),]. 
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Table XIII 

Binding of purified hGH mutants to hGHbp. 

Competition binding experiments were perfomied using (125ijhGH (wild-type), hGHbp (containing the 
extracellular receptor domain, residues 1-238). and Mab263 (11). The number P indicates the fractbnal 
occurrence of each mutant among aN the clones sequenced after one or more rounds of selection. Note that the 
helix 4b mutations (*) are in the background of hGH(E174S/F176Y). In the list of helix 4b mutants,, the 
E174S/F176Y mutant (*), with wt residues at 167. 171, 175, 179, is shown in bold. 
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Assenrribty of Moiecule on the Phagernki Surtax 

ConstructtongfptemMs 

Plasmid pDH 188 contains the DNA encoding the Fab portion of a humanized IgG antibody, cailed 4D5, 
5 that recognizes the HER-2 receptor. This plasmid is contained in E. co* strain SR 101 , and has teen deposited 
with the ATCC in Rockville, MD. 

Briefly, the plasmid was prepared as follows: the starting plasmid was pS0132, containing the alkaline 
phosphatase promoter as described above. The DNA encoding human growth homfione was excised and, after a 
series of manipulations to make the ends of the plasmki compatit)le for ligation, the DNA encoding 4D5 was 

1 0 inserted. The 4D5 DNA contains two genes. The first gene encodes the variable and constant regk>ns of the light 
chain, and contains at its 5' end ttie DNA encoding the st II signal sequence. The second gene contains four 
portions: first, at its 5* end is the DNA encoding the st II signal sequence. THs Is followed by the DNA encoding 
the variable domain of tfie heavy chain, which is followed by the DNA encoding the first domain of the heavy chain 
constant region, which in turn is foltowed by the DNA encoding the M13 gene III. The salient features of this 

1 5 construct are shovel in Ftgure 10. The sequence of the DNA encoding 4D5 is shorn in Figure 1 1 . 
£ cof/transfofmation and phage production. 

Both polyethylene glycol (PEG) and electroporation were used to transform plasmids into SR101 cells. 
(PEG competent cells were prepared and transformed according to the method of Chung and Miller (Nucleic 
Acids Res. 16:3580 [1988]). Cells that were competent for electroporation were prepared, and subsequently 

2 0 transformed via electroporation according to the method of Zabarovsky and Winberg [Nucleic Adds ffes. 1 8 :59 1 2 
[1990]). After placing the cells in 1 ml of the SOC media (described in Sambrook etai, supra), they were grown 
for 1 hour at 37^C with ^king. At this time, the concentration of the cells ms determined using light scattering 
at ODgoo. A titered K07 phage stock was added to achieve an multiplicity of infection (MOt) of 100, and the 
phage were aitowed lo adhere to the cells for 20 minutes at room temperature. This mixture was then diluted into 

2 5 25 mis of 2YT broth (described in Sambrook et aA, supra) and incubated with shaking at Zl^'O overnight. The 
r>ext day, cells were pelleted by centrifugalion at 5000 x g for 10 minutes, the supernatant was collected, and the 
phage particles were precipitated with 0.5 M NaCI and 4% PEG (final concentration) at room temperature for 10 
minutes. Phage particles were pelleted by centrifi^ation at 10,000 x g for 10 minutes, resuspended in 1 ml of TEN 
(10 mM Tris, pH 7.6» 1 mM EDTA, and 150 mf^ NaCi). and stored at 4X. 

30 Productkffl of antigen coated plate^ 

Aik^uots of 0.5 ml from a solution of 0.1 mg/ml of the extra-cellular domain of the HER-2 antigen (ECD) 
or a solution of 0.5 mg/ml of BSA (control antigen) in 0.1 M sodium bfcarbonate, pH 8.5 were used to coat one well 
of a Falcon 12 tissue culture plate. Once the solution was applied to the wells, the plates were incubated at 
4°C on a rocking platform overnight The plates were then bk)cked by removing the initial solution, applying 0.5 ml 

35 of blocking buffer (30 mg/ml BSA in 0.1 M sodium bicartonate), and incubating at room temperature for one hour. 
Finally, the blocking buffer was removed, 1 ml of buffer A (PBS, 0.5% BSA, and 0.05% Tween-20) ms added, 
and the plates were stored up to 10 days at 4*C before being used for phage selection. 
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Phage sgfetfon process, 

Approximately 10^ phage partides were mixed with a lOO-foki excess of K07 helper phage and 1 m\ of 
buffer A . This mixture was divided into two 03 ml aliquols; one of which was applied to ECD coated wells, and 
the other was applied to BSA coated wells. The plates were Inatoted at room temperature while shaking for one 
5 to three hours, and were then washed three times over a period of 30 minutes with 1 mi aliquots of buffer A. 
Elution of the phage from the plates was done at room temperati^ by one of two methods: 1) Bn initial overnight 
incubation of 0.025 mg/ml purified Mij4D5 antibody (murine) followed by a 30 minute incubation with 0.4 ml of 
the add elution buffer {0.2 M glycine, 2.1 , 0.5% BSA, and 0.05% Tween-20). or 2) an incubation with the acid 
elution buffer alone. Ekiates were then neutralized with 1 M Tris base, and a Oi ml aliquot of TEN was added. 
1 0 These samples were then propagated, litered, and stored at 4^. 
Phaggprepacatten 

Aliquots of eiuted phage were added to 0.4 ml of 2YT broth and mixed with approximately 10^ mid-log 
phase mate E co// strain SR101. Phage were allowed to adhere to the cells for 20 minutes at room temperature 
and then added to 5 ml of 2YT broth that contained 50 ^ml of carbenicillin and 5 jxg/ml of tetracycline. These 

1 5 cells were grown at 37*C for 4 to 8 hours until they reached mid-log phase. The ODeoo was determined, and the 
cells were superinfected with K07 helper phage for phage production. Once phage particles were obtained, ttiey 
were titered in order to determine the number of colony fomiing units (cfu). This was done by taking aliquots of 
serial dilutions of a given phage stock, allowing them to infect mid-log phase SR1 01 , and plating on LB plates 
containing 50 Dg/ml carbenicillin. 

20 RIAaffinltv determination. 

The affinity of h4D5 Fab fragments and Fab phage for the ECD antigen was determined using a 
competitive receptor binding RIA (Burt, D. R., Receptor Binding in Dnjg Research, CBrien. R.A. (Ed.), pp. 3- 
29, Dekker, New York |1986]). The ECD antigen was labeled with ^^^.^^jne using the sequential chloramine-T 
method (De Larco, J. E. etai, J. CeiL Physiol. 109:143-152 11981)) which produced a radioactive tracer with a 

25 specific activity of l4^Ci/^ and incorporation of 0.47 moles of Iodine per mole of receptor. A series of 0.2 ml 
solutions containing 0.5 ng (by ELISA) of Fab or Fab P^^Se, 50 nCi of ^^^i ecd tracer, and a range of unlabeled 
ECD amounts (6.4 ng to 3277ng} were prepared and incubated at room temperature overnight. The labeled 
ECD-Fab or ECD-Fab phage complex was separated from the unbound labeled antigen by forming an aggregate 
comptex induced by the addition of an anti-human IgG (Fitzgerald 40-GH23) and e% PEG 8000. The complex was 

3 0 pelleted by centritugalion (1 5,000 x g for 20 minutes) and the amount of labeled ECD {in cpm) was determined 
by a gamma counter. The dissociation constam (Kd) v«is calcinated by emptoying a modified version of tiie 
program LIGAND (Munson, P. and Rothbard, D., Anat. Biocbem, 107220-239 11980J) which utilizes Scatchard 
analysis (Scatchard, G. NX Acad Sd, 51.-660-672 [19491). The Kd values are shown in Figure 13. 
COTpetftrvewH Wrnlng fum 

35 Murine 4D5 antibody was labeled with 125-1 to a specific activity of 40-50 nCi/^ig using the fodogen 

procedure. Soluttons containing a constant amount of labeled antibody and increasing amounts of unlabeled variant 
Fab were prepared and added to near confluent cultures of SK-BR-3 cells grown m 96-wefl microtiter dishes 
(final concentration of labeled antibody was 0.1 nM). After an overnight incubation a! 4**C, the supernatant was 
removed, the cells were washed and the cell associated radioactivity was determined in a gamma counter. Kd 



values were determined by analyzing the data using a modified version of the program LfGAND (Munson, P. and 
Rothbard, D., supra) 

This deposit of plasmid pDH188 ATCC no. 68663 was made under the provisions of the Budapest 
Treaty on the tntemafionat Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure 
5 and the Regulations thereunder (Budapest Treaty). This assures maintenance of a viable culture for 30 years 
from the dale of deposit. The organisms will be made available by ATCX^ under the terms of the Budapest 
Treaty, and subject to an agreement between Genentech, inc. and ATCC, which assures pemianent and 
unrestricted availability of the progeny of the cultures to the public ^X)n issuance of the pertinent U.S. patent or 
upon laying open to the pubTic of any U. S. or foreign patent application, whichever comes first, and assures 

1 0 availability of the progeny to one determined by the U. S. Commissioner of Patents and Trademarks to be entitled 
thereto according to 35 USC §122 and the Commisaoner's rules pursuant thereto (including 37 CFR §1.14 with 
particular reference to 686 OQ 638). 

The assignee of the present application has agreed that if the cultures on deposit should die or be lost 
or destroyed when cultivated under suitable conditions, they will be promptly replaced on notification vwth a viable 

1 5 specimen of the same culture. Availability of ttw deposited cultures is not to be construed as a license to practice 
the invention m contravention of the rights granted under the authority of any government in accordance with its 
patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the cultures deposited, since the 

20 deposited embodiments are intended as separate illustrations of certain aspects of the invention and any cultures 
that are functionally equivaleni are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition to 

25 those shovm and described herein will become apparent to those skilled in the art from the foregoing description 
and fall v«thin the scope of the appended datms. 

While the invention has necessarily been described in conjuncfion with preferred embodiments, one of 
ordinary skill, after reading the foregoing specification, will be able to effect various changes, substitutions of 
equivalents, and alterations to the subject matter set forth herein, without departing from the spirit and scope 

30 thereof. Hence, the inventton can be practiced in ways other than those spedficalty It is 

therefore intended that tt>e protectk)n granted by Letters Patent hereon be limited only by the appended claims 
and equivalents thereof. 
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EXAMPLE Xil 

SELECTION OF hGH VARIANTS FROM COMBINATIONS OF HEUX-1 AND HEUX4 HORMONE^PHAGE 

VARIANTS 

5 Construcflon of additf ve variants of hGH 

According to additivity principles [J. A. Wells, Biochemistry 29, 8509 (1990)1, mutations in different 
parts of a protein, if they are not mutually interacting, are expected to combine to produce additive changes in the 
free energy of binding to another molecule (changes are additive in terms of MGbinding- ^ multiplicative in 
terms of = exp(-AG/RT] ). Thus a mutation producing a 2-foid increase in binding affinity, when combined 

10 with a second mutation causing a 3-fold increase, would be predicted to yield a double mutant with a 6-fo[d 
increased affinity over the starting variant 

To test whether multiple mutations obt^ned from hGH-phage selections would produce cumulatively 
favorable effects on hOHbp (hGH-binding protein; the extracellular domain of the hGH receptor) binding, we 
combined mutations found in the three tightest-binding variants of hGH from me helix-l library (Example IX: 

1 5 F10A/MUW/H18D/H21N, F10H/M14G/H18N/H21N, and F10F/MUS/H18F;H2tL) with those found in the 
three tightest binding variants found in the heiix-4b library (Example X: R167N/D171 8^-175/11791, 
R167eD171S/T175/t179, and R167N/D171N/T175/I179T). 

hGH-phagemtd double-stranded DNA (dsDNA) from each of the one-hetix variants was isolated and 
digested with the restriction enzymes EcoR\ and BstXl. The large fragment from each heiix-4b variant was then 

20 isolated and ligated with the small fragment from each helix-1 variant to yield the new two-helix variants shown in 
Table XIIL All of these variants also contained the mutations E174S/F176Y obtained in earlier hGH-phage binding 
selections (see Example X for details). 
Constructfon of selective combinatorial itbrartes of hGH 

Although additivity principles appear to hold for a number of combinations of mutations, some 

25 combinations (e.g. E174S with F176Y) are dearly non-additive (see examples Vili and X). In order to identify 

with certainty the lightest binding variant with, for example, 4 mutations in helix-1 aM 4 mutations in helix-4, one 
would ideally mutate all 8 residues at once and then sort the pool for 8ie globally tightest binding variant 
However, such a pool would consist of 1.1 x 10^2 qj^ja sequences (uh'lizing NNS codon degeneracy) encoding 2.6 x 
10^^ different polypeptides. Obtaining a random phagemid library large enough to assure representation of all 

30 variants (perhaps 10^3 ^gpsformants) is not practical using current transformation tedrology. 

We have addressed this difficulty first by utilizing succesave rounds of mutagenesis, taking the 
tightest binding variant from one library, then mutating other residues to further improve binding (Example X). 
In a second method, we have utilized the principle of additivity to combine the best mutations from two 
independently sorted libraries to create multiple mutants wth improved binding (described ^ve). Here, we 

35 further searched through the possible combinations of mutations at positions 10, 14, 18, 21, 167, 171, 175, and 
179 in hGH, by creating combinatorial libraries of random or partially-random mutants. Y/e constructed three 
different comttnatorial libraries of hGH-phagemids, using the pooled phagemids from the helix 1 library 
(independently sorted for 0. 2, or 4 cycles; Example IX) and the pool from the helix-4b library (independently 
sorted for 0. 2, or 4 cycles; Example X) and sorted the combined variant pool for hGHbp binding. Since some 

40 amount of sequence diversity exists in each of these pools, the resulting combinatorial lilxary can explore more 
sequence combinations than what we might construct manually (e.g. Table Xlil). 
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hGH-phapemki doubte-stranded DNA (dsDNA) from each of the one-helix library poois (selected for 0, 
2, or 4 rounds) was isolated and digested with the restriction enzymes Acci and BstX\. The large fragment from 
each he)ix-l variant pool was then isolated and iigated with the small fragment from each helix-4b variant pool to 
yield the three combinatorial libraries pH0707A (mselected helix 1 and helix 4b pools, as described in examptes !X 
5 and X), pH0707B (twice-selected helix-1 pool with tvwce-sefected helix4b pool), and pH0707C (4-times 

selected helix-1 pool with 4-times selected heltx-4b pool). Duplicate ligations were also set up with less DNA and 
desisted as pW707D, pH0707E, and pH07Q7F. corresponding to ^ 0%2-, and 4-round starting libraries 
respectively. All of these variant poofs also contained the mutations E174S/F176Y obtained in earlier hGH-phage 
binding selections (see Example X for details). 

10 Sortinq combmatortal libraries of hGH>phaqe variams 

The ligation products pH0707A-F were processed and electro-transformed into XLI-Biue cells as 
described (Example Vlli). Based on colony-forming units (CFU), the number of transformanls obtained from each 
pool was as follows: 2.4x10® from pH0707A, 1.8x10^ from pH0707B, 1.6x10^ from pH0707C, 8x10^ from 
pH0707D, 3x1 o5 from pH0707E, and 4x10^ from pH0707F. hGH-phagemid particles were prepared and 

1 5 selected for hGHbp-binding over 2 to 7 cycles as described in Example VIII. 
Rapid sortinq pt hGH-phsggml d B&rarigs 

In addition to sorting phagemid libraries for tight-binding protein variants, as measured by equilibrium 
binding affinity, it is of interest to sort for variants which are altered in either the on-rate (kon) or the off-rate 
(koff) of binding to a receptor or other molecule. From thermodynamics, these rates are related to ihe 

20 equilibrium dissociation constant. K<j = (koff/kon). We envision that certain variants of a particular protein have 
simitar Kd's for binding while having very different kon's and koff's. Conversely, changes in from one variant to 
another may be due to effects on kon. effects on koff, or both. The pharmacological properties of a protein may 
be dependent on binding affinity or on kon or koff, depending on the detailed mechanism of action. Here, we sought 
to kientify hGH variants with higher on-rales to investigate the effects of changes in kon- We envision that the 

2 5 selection could alternatively be weighted toward koff by increasing the tending time and increasing the wash time 

and/or concentration witti cognate ligand (hGH). 

From time-course analysis of wild-type hGH-phagemid binding to immobilized hGHbp, it appears that, 
of the total hGH-phagemid partk;les that can be eluted in the final pH 2 wash (see Example VIII for the complete 
binding and eiution protocol), less than 10% are bound after 1 minute of incubalbn, while greater than 90% are 

3 0 bound after 1 5 minutes of incubatbn. 

For "rapid-binding selection,* phagemid particles from tfie pH0707B pool (twice-selected for helices 1 
and 4 independently) were incubated with immobilized hGHbp for only 1 minute, then washed six times with 1 mL of 
binding buffer; the hGH^ash step was omitted; and the remaining hGH-phagemid particles were eluted with a 
pH2 (0.2M glycine in binding buffer) wash. Enrichment of hGH-phagemid parades over non-displaying particles 
35 indicated that even with a short binding perkxJ and no cognate-iigand (hGH) challenge, hGH-phagemid binding 
selection sorts tight-binding variants out of a randomized pool 
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Assay gfhGH mutants 

The binding constants for some of these mutants of hGH to hGHbp was determined by expressing the 
free hormone variants in the non-suppressor £ co// strain 16C9 or 34B8, purifying the protein, and assaying by 
competitive displacement of labelled wt-hGH fn>m hGHbp (see Example Vll!) in a radio-immunoprectf^tation assay, 
5 ^ Table Xtll -A below, alt the variants have glutamatei74 replaced by serinei74 and phenyiaianinei 76 replaced by 
tyrosine 176 (E174S and F1176Y) plus the additional substitutions as indicated at hGH amino add positions 1 0, 
14,18. 21J67,17lJ75andl79. 
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Table Xltl-A 

hGH variants from addtion of belbc*1 and h^x4b mutatfons 

Heibfl Helix 4 






wild'type residue: 


m 


M14 


Hig 


H21 


filSZ 


mil 


T175 


JI 




Variant 


















15 


H0650AD 


H 


G 


N 


N 


N 


s 


T 


T 




H0650AE 


H 


G 


N 


N 


E 


s 


T 


1 




H0650AF 


H 


G 


N 


N 


N 


N 


T 


T 




H0650BD 


A 


W 


D 


N 


N 


s 


T 


T 




H0650BE 


A 


W 


D 


N 


E 


s 


T 


1 


20 


H0650BF 


A 


w 


D 


N 


N 


N 


T 


T 




H0650CD 


F 


s 


F 


L 


N 


s 


T 


T 




H0650CD 


F 


s 


F 


L 


E 


s 


T 


1 




H0650CD 


F 


s 


F 


L 


N 


N 


T 


T 



25 in Table XtV below, hGH variants were selected from combinatorial libraries by the phagemid binding 

selection process. Ail hGH variants in Table XIV contain two background mutations (E174S/F176Y), hGH- 
phagemid pools from the libraries pH0707A (Part A), pH0707B and pH0707E (Part B), or pH0707C (Part C) 
were sorted for 2 to 7 cycles for binding to hGHbp. The number £ indicates the fractiona! occurrence of each 
variant type among the set of clones sequenced from each pool. 
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hGH variants from homx)n^^^ 



5 wild-type residue; 



librarfes. 



10 



15 



20 



25 



30 



35 



40 



Part A : 
0.60 

0.40 
Pan B : 

0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 
0.13 

0.17 
0.17 
0.17 
0.17 
0.17 
017 

0.57 
0.14 
0.14 
0.14 
0.50 
Pan C: 

0.67 



Variant 
4 cycles: 
H0714A.1 

H0714A.4 

2cydes: 

H0712B.1 

H0712B.2 

H0712B.4 

H0712B.5 

H0712B.6 

H0712B.7 

H0712B.8 

H0712E.1 

H0712E.2 

H0712E.3 

H0712E.4 

H0712E.5 

H0712E.6 

H0712E.7 

H0712E.8 

4 cycles: 

H0714B.1 

H0714B.2 

H0714B.3 

H0714B.4 

H0714B.5 

H0714B.6 

7cydes: 

H0717B.1 

H0717B.2 

H0717B.6 

H0717B.7 

H0717E.1 

4 cycles: 
H0714C.2 



M l 



MIA m m 



H 
A 



F 

H 

H 

F 

A 

Q 

W 

F 

W 

A 

P 

H 

F 

M 

F 

A 
A 
M 

H 
L 
L 

A 
F 
A 
I 

F 



G 
N 



S 

Q 

G 

S 

S 

Y 

G 

L 

N 

N 

S 

G 



N 
D 



F 
T 
N 
F 
T 
N 
S 

s 

N 
A 
D 
N 



s 


T 


G 


T 


S 


N 


S 


F 


L 


w 


D 


N 


w 


D 


N 


Q 


M 


N 


Y 


D 


H 


N 


S 


H 


N 


S 


H 


W 


D 


N 


s 


T 


G 


w 


D 


N 


Q 


E 


H 


S 


L 


A 


s 


F 


L 



N 
A 



G 

S 

N 

L 

N 

N 

S 

S 

S 

s 

N 



om im m 



N ' 



T 
S 
T 
T 
I 

T 
i 

V 
T 
T 

I 

S 

I 

T 
S 



N 


N 


T 


H 


S 


T 


A 


D 


N 


N 


A 


T 


S 


D 


T 


R 


D 


T 


H 


S 


T 


R 


D 


T 


K 


N 


T 


H 


S 


T 


N 


S 


T 


R 


D 


T 


N 


H 


T 


R 


D 


T 


Q 


S 


T 


T 


s 


T 



R 
H 
N 
R 
R 
T 

N 
R 
R 
N 
N 



D 
S 
S 
D 
D 
S 

A 

D 
D 
S 
S 



T 
T 
T 
T 
T 
T 

T 
T 
T 
T 
T 



N 
T 
T 
I 

T 



T 
V 
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= also contained the mutations L15R, K168R. 



=0 ~o»^^«=..r~~:rzz- 
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Table XV 

hGH valiants from RAPID hGHbpbbxttngse^^ library 



MLl Helix 4 



wild-type 


residue: 
Variant 


Elfl 


M14 


H18 


1121 


fil£Z 


D171 


im 


m 


0.14 


H07BF4.2 




G 


S 


S 


R 


D 


T 


1 


0,57 


H07BF4.3 


M 


A 


D 


H 


N 


S 


T 


T 


0.14 


H07BF4.6 


A 


W 


D 


N 


S 


S 


V 


Tt 


0.14 


H07BF4J 


H 


Q 


T 


S 


R 


D 


T 


1 



t = also contained the mutation Y176F (wild-type hGH also contains F176). 



In tatrfe XVI below, binding constants were Tneasured by competitive displacement of "'^^l-iabelled 
hormone H0650BD or labelled hGH using hGHbp (1-238) and either MabS jot Wab263. The vanant H0650BD 
appears bind more than 30-fold tighter than wild-type hGH. 
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TabteXVJ 

EqJQbrlum binding constants of seiec^ 



15 



hGH 


Kd(Yartant). 


Mvariant) 




Variant 


Kd(H0650BD) 


Kd{hGH) 


Kd (pMi 


hGH 


32 


-1- 


340 i 50 


H0650BD 


-1- 


0.031 


10± 3 


H0650BF 


15 


0,045 


15± 5 


H0714B.6 


3.4 


0.093 


34i 19 


H0712BJ 


7.4 


0.22 


74i 30 


H0712E.2 


16 


0.48 


60i70 



20 

EXAMPLE Xlii 

Setectrve ennchment of hGH-phage containing a protease substrate sequence versus non-sii)strate phage 

25 As described in Example [, the piasmid pS0132 contains the gene for hGH fused to the residue Pro198 of the gene 
ill protein with the insertion of an extra glycine residue. This piasmid may be used to prodLX^ hGH-phage particles 
In which the hGH-gene lii fusion product is displayed monovatently on the phage surface (Example ! V). The fusion 
protein comprises the entire hGH protein fused to the cartx)xy terminal domain of gene Itl via a flexible linker 
sequence. 

30 To investigate the feasibility of using phage display technology to select favourable substrate 

sequences for a given proteolytic enzyme^ a genetically engineered variant of subttlisin BPN' was used. (Carter, P. 
et al.. Proteins: Structure, function and genetics 6:240-248 (1989)). This variant (hereafter refen^ed to as 
A64SAL subttlisin) contains the following mutations: Ser24Cys, His64Ala, GlulBGSer, Giy169Aia and 
Tyr217Leu. Since this enzyme lacks the essential catalytic residue His64, its substrate spedfictty is greatly 

35 restricted so that certain histidine-containing substrates are preferentially hyrdrolysed (Carter et aL, Science 
237:394-399 (1987)). 
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ConstriJCtiQn of a hGH^bstrate-phaae vector 

The sequence of the Drker region in pS0l32 was mutated to create a substrate sequence for A64SAL 
subtilisin, using the oligonucleotide 5'-TTC-GGG-CCC-TTC-GCT-GCT-CAC-TAT-ACG-CGT-CAG-TCG- 
ACT-GAC-CTG-CCT-3'. This resulted in the irrtroduction of the protein sequence Phe-Gly-Pro-Phe-Ala-AIa- 
5 His-Tyr-Thr-Arg-Gln-Ser-Thr-Asp in the linker region between hGH and the cartwxy terminal domain of gene iii, 
where the first Phe residue in the above sequence is Phe191 of hGH. l\\e sequence Ala-Ala-His-Tyr-Thr-Agr-GIn 
is known to be a good substrate for A64SAL subtiltsin (Carter et al (1989). supra). The resulting plasmid was 
designated pS0640. 

10 Selective enrichnfient of hGH^s ubstrat^phapfi 

Phagemid particles derived fronn pS0132 arvd pS0640 were constructed as described 
in Exanrpie I. In initial experiments, a 10^1 aliquot of each phage poo! v^^as separately niixed with 
30|il of oxirane beads (prepared as described in Example II) in lOOjil of buffer comprising 
20mM Tris-HCi pH 8.6 and 2.5M NaCl. The bindirig and washing steps were pertornDed as 

1 5 described in example VII. The beads were then resuspended in 400m-I of the same buffer, with 
or without 50nM of A64SAL subtilisin. Following irx^ubation for 10 minutes, the supematants 
were collected arid the phage titres (cfu) measured. Table XVli shows that approximately 10 
times more substrate-containing phagemid particles (pS0640) were eluted in the presence of 
enzyme than in the absence of enzyme, or than in the case of the non-substrate phagemids 

20 (pS0l32) in the presence or abserrce of enzynrie. Increasing the enzyme, phagemid or bead 
concentrations did not inprove this ratio. 

Improvement of the selective enrichment procedure 

in an attempt to decrease the r>on-specific elution of immobilised phagemids, a tight- 

25 binding variant of hGH was introduced in place of the wiki-type hGH gene in pS0132 arxi 

pS0640. The hGH variant used was as described in example XI (pHOSSOtxi) and contains the 
mutations PhelOAfa, Met14Trp» HislSAsp, His21Asn, Arg167Asn, Asp171Ser, Glu174Ser, 
Phe176Tyr and lle179Thr. This resulted In the construction of two new phagemids: pDM0390 
(containing tight-binding hGH and no substrate sequence) and pDM0411 (containing tight- 

30 binding hGH and the substrate sequence Ala-Ala-His-Tyr-Thr-Agr-Gln), The binding washing 
and elution protocol was also changed as foltows: 

0) Binding: COSTAR 12-weH tissue culture plates were coated for 1 6 hours with 
O.SmlA^fell 2ug/ml hGHbp in sodium cartx>nate buffer pH 10.0. The plates were then incubated 
with 1 mi/well of blocking buffer (phosphate buffered saline (PBS) containing 0.l%w/v bovine 

35 semm albumen) for 2 hours and washed in an assay buffer containing 10mM Tris-HCI pH 7.5, 
ImM EDTA and lOOmM NaC!. Phagemids were again prepared as described in Exanrple I: the 
phage pool was diluted 1 :4 in the at>ove assay buffer and 0.5ml of phage incubated per well for 
2 hours. 
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(ii) Washing: The plates were washed thoroughly with PBS + 0.05% Tween 20 
arxj incubated for 30 minuted with 1ml of this wash buffer. This washing step was repeated 
three times. 

(ii) Eution: The plates were incubated for 10 minutes in an elation buffer 
5 consisting of 20mM Tris-HCI pH 8.6 + lOOmM NaCI, then the phage were eluted with 0.5ml of 
the above buffer with or without SOOnM of A64SAL subtilisin. 

Table XVil shows that there was a dramatic irx:rease in the ratio of spedfically eiuted 
substrate-phagemid particles compared to the method previously described for pS0640 and 
pS0132. It is likely that this is due to the fact that the tight-binding hGH nnutant has a 
1 0 significantly slower off-rate for binding to hGH binding protein compared to wild-type hGH. 

Table XVII 

Specific etutlon of substrate-phagemids by A64SAL subtlllsin 

15 

Colony forming units (cfu) were estimated by plating out lOjil of 10-fold dilutions of phage on 
10^1 spots of XL-1 blue cells, on LB agar plates containing SOjig/ml cart>enidllinl 

(i) Wild-type hGH gene: binding to hGHbp-oxirane beads 

. phagemid 1 5QnM A64SAL no enzyme 

pS0640 (substrate) 9x10^cfu/10^l 1.5x10^cfu/10ui 

pS0132 (non-substrate) 6x10%u/10nl SxlO^cfu/IO^l 

(ii) pH0650bd mutant hGH gene: binding to hGHbp-coated plates 
Ph aoemid + 50nM A64SAL no enzvme 

pDM0411 (substrate) 1.7x10%u/10hI 2x10^cfu/1 0^1 

pDM0390 (non-substrate) 2x103cfu/10ul IxlO^cfu/IO^I 

35 Example XIV 

Identification of preferred substrates for A64SAL subtilisin using selective 
enrichment of a library of substrate sequences. 

40 We sought to employ the selective enrichment procedure described in Example XIII to 

identify good substrate sequences from a library of random substrate sequences. 

Construction of a vector for insertion of randomised substrate cassettes 

We designed a vector suitable for introduction of randomised substrate cassettes, and 

45 subsequent expression of a library of substrate sequences. The starting point was the vector 
pS0543, described in Example VIII. Site-directed mutagenesis was carried out using the 
OliCK)nucleotide5'-AGC-TGT-GGC-TrC -GGG-CCC -GCC-GC(>G CG-TCG-ACT -GGC-C^ 
TCT-3\ whi(^ introduces Acal (GGGCCC) and Sail (GTCGAC) restriction sites between hGH 
and Gene III. This new constmct was designated pDM0253 (The actual sequence of pDM0253 

50 is 5*-AGK:-TGT-GGC-TTC-GGG<^CC<K)C^CCC-GCG-TCG-ACT-GGC-^ where 



20 



25 



30 
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the underlined base substitution is due to a spurious error in the mutagenic oligonucleotide). 
In addition, the tight-binding hGH variant described in example was introduced by exchanging 
a fragment from pDM04l 1 (example Xlil) The resutting ybrary vector was designated 
PDM0454. 

5 

Preparation of the Ht?rary cassette vector and Insgrtlon of the mutagenic 
cassette 

To introduce a Kbrary cassette, pDM0454 was digested with Apal followed by Sail, then 
precipitated with 13% PEG 8000+ lOmM MgCi2, washed twice in 70% ethanoi and 

1 0 resuspended This efficiently precipitates the vector but leaves the small Apa-Sal fragment in 
solution (Paithankar, K. R. and Prasad. K. S. N., Nucleic Acids Research 19:1346). The 
product was run on a 1% agarose gel and the Apal-Sall digested vector excised, purified uang 
a Bandprep kit (Pharmacia) and resuspended for ligation with the mutagenic cassette. 

The cassette to be inserted contained a DNA sequence similar to that in the linker 

1 5 region of pS0640 arxi pDM04l 1 , but with the codons for the histidine and tyrosine residues in 
the substrate sequence replaced by randomised codons. We chose to substitute NNS 
(N=G/A/T/C; S=G/C) at each of the randomised positions as described in example VIII. The 
oligonucleotides used in the mutagenic cassettes were: 5 -C-TTC-GCT-GCT-NNS-NNS-ACC- 
CGG-CAA-3* (cocSng strand) and 5*-T-CGA-TTG-CCG-GGT-SNN-SNN-AGC-AGC-GAA-GGG- 

20 CC-3' (non-coding strand). This cassette also destroys the Sail site, so that digestion with Sail 
may be used to reduce the vector backgrourKl. The oligonucleotides were riot 
phosphorylated before insertion into the Apa-Sal cassette site, as it was feared that 
subsequent oligomerisation of a small population of the cassettes may lead to spurious results 
with multiple cassette inserts, Foltowing annealing and ligation, the reaction products were 

25 phenolrchloroform extracted, ethane! precipitated arxj resusperxled in water. Initially, no 

digestion with Sail to reduce the background vector was performed. Approximately 200ng was 
eiectnoporated into XL-1 blue cells and a phagemid library was prepared as described in 
exarrH>te VIIL 

30 Selection of hlahlv cleavabie substrates from the substrate library 

The selection procedure used was Identical to that described for pDM04l1 and 
pDM0390 in example XIIL After each round of selection, the eluted phage were propagated by 
transducing a fresh culture of XL-1 blue cells and propagating a new phagemid library as 
described for hGH-phage in example Vlll. The progress of the selection procedure was 

35 monitored by measuring eluted phage titres and by sequencir>g individual clones after each 
round of selection. 

Table A shows the successive phage titres for eiution in the presence and abserrce of 
enzyme after 1, 2 and 3 rounds of selection. 
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Clearly, the ratio of spedftcally elated phage: non-specificaiiy eiuted phage (ie phage eiuted 
with enzyme :phage eiuted without enzyme) ir>creases dramatically from round 1 to round 3, 
suggesting that the population of good substrates is increasing with each round of selection. 
Sequencing of 10 isolates from the starting library showed them all to consist of the 
5 wild-type pDM04S4 sequence. This is attributed to the fact that after digestion with Apal, the 
Sal! site is very ctose to the end of the DNA fragments thus leading to low efficiency of 
digestion. Nevertheless, there are only 400 possible sequences in the library, so this 
population should still be well represented. 

Tables B1 and B2 shows the sequences of isolates obtained after round 2 aryj round 

10 3 of selection. After 2 rounds of selection, there is clearly a high incidence of histidine 

residues. This is exactly what is expected: as described in example Xlll, A64SAL subtilistn 
requires a histidine residue in the substrate as ft employs a substrate-assisted catalytic 
mechanism. After 3 rounds of selection, each of the 10 clones sequenced has a histidine in 
the randomised cassette. Note, however, that 2 of the sequences are of pDM0411 , which was 

15 noX present in the starting library and is therefore a contanvnant. 
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Table A 

Tttratlon of Initial phage pools and eluted phage from 3 rounds of selective 

enrichment 



Colony tombing units (cfu) were estimated by plating out 10^1 of 10-fokJ dilutions of phage on 
10^ spots of XL-1 blue cells, on LB agar plates containing 50^g/ml cart>eniciliin 

RQUNP 1 



10 



15 



20 



25 



30 
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Starting library: 

LIBRARY: 

pDM0411: 
(control) 

RO UN D 2 

Round 1 library: 

LIBRARY: 

pDM0411: 
(control) 

R OUND 3 

Round 2 library: 

LIBRARY: 



pDM0411: 
(control) 



3x1 0*^2 ciu/m\ 

+500nM A64SAL 
no enzyme 

+500nM A64SAL 
no enzyme 



7x10^*2 cfy/rpl 

+500nM A64SAL 
no enzyme 

+500nM A64SAL 
no enzyme 



7x1 0'*"^ cfu/ml 

+500nM A64SAL 
no enzyme 

+500nM Ae4SAL 
r>o enzyme 



4x1 03 cfu/IOtil 
3x10^ Cfu/IO^il 

2x1 0^ cfu/IOiil 
8x1 03 cfu/IORl 



3x1 04 cfu/10^1 
6x1 03 cfu/IO^il 

3x1 0^ cfu/IOjil 
1.6x10"* cfu/IOul 



1x10^ cfu/10^1 
<103 cfu/IO^I 

5x1 06 cfu/10^1 
3x10"* cfu/10|il 
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Table Bi 

Sequences of eluted phage after 2 rounds of selective enrichment. 



5 All protein sequences should be of the fonn AA"TRQ, where * represents a 

randomised codon. In the table below, the randomised codons and amino acids are underlined 
and in bokJ. 

Aftgr round 2: 

10 

Sequence No. of 

occurrences 

* * 

15 





A 


A 






T 


R 


Q 








. , GCT 


GCT 


cue 




ACC 


CGG 


CAA . . . 




2 




A 


A 


2i 




T 


R 


Q 






20 


. , GCT 


GCT 






ACC 


CGG 


CAA . , . 




1 




A 


A 


Xi 


& 


T 


R 


Q 








. , GCT 


GCT 


SIC 




ACC 


CGG 


CAA . . . 




1 


25 


A 


A 




n 


T 


R 


Q 








. . GCT 


GCT 


cxs 




ACC 


CGG 


CAA . . . 




1 




A 


A 


E 


z 


R 


Q 










. . GCT 


GCT 






CGG 


CAA 




1 


# 


30 






















A 


A 






T 


R 


Q 








. . GCT 


GCT 






ACC 


CGG 


CAA 


1 


## 



. . . wild-type pDM04 54 

35 



# - spurious deletion of 1 codon within the cassette 
## - ambiguous sequence 
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Table B2 

Sequences ot elui&d phage after 3 rounds of selective enrichment, 

5 All protein sequences shouW be of the form AA**TRQ, where * represents a 

randomised codon. \r\ the table betow, the randomised codons and amino acids are ur>derlined 
and in bold. 

After round 3; 

10 

Sequence No. of 

occurrences 

* * 





A 


A 




X 


T 


R 


Q 






15 


. . GCT 


GCT 




.x&z 


ACG 


CGT 


CAG . . . 




2 




A 


A 


li 


E 


T 


R 


Q 








. . GCT 


GCT 


CTC 


CJiQ 


ACC 


CGG 


CAA , . , 




2 


20 


A 


A 


Q 


E 


T 


R 


Q 








. . GCT 


GCT 






ACC 


CGG 


CAA . . . 




1 




A 


A 




E 


T 


R 


Q 








. . GCT 


GCT 


hQQ 




ACC 


CGG 


CAA . . . 




1 


25 






















A 


A 


R 




R 


Q 










. . GCT 


GCT 


UiQ 


YCC 


CGG 


CAA 




1 






A 


A 


E 


E 


T 


R 


Q 






30 


, . GCT 


GCT 






ACC 


CGG 


CAA 


1 


## 




A 


A 


E 




R 


Q 










. . GCT 


GCT 




xxc 


CGG 


CAA 




1 




35 


A 


A 


E 


X 


R 


Q 










. . GCT 


GCT 




ACC 


CGG 


CAA 




1 





# - contaminating sequence from pDM0411 

## - contains the '^illegal" codon CAT - T should not appear in the 3rd position of a 
40 codon. 



45 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (i) APPLICANT: Genentech, Inc. 

Garrard, Lisa J. 
Henner,^ Dennis J. 
Bass, Steven 
Greene, Ronald 
10 Lowman, Henry B. 

Wells, James A. 
Matthews, David J. 

(ii) TITLE OF INVENTION: Enrichment Method For Variant Proteins With 
15 Altered Binding Properties 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

20 (A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Blvd 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 
25 (F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

<A) MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 

(B) COMPUTER: IBM PC compatible 

30 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: pat in (Genentech) 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
35 (B) FILING DATE: 03-DEC-91 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/743614 
40 (B) APPLICATION DATE: 09-Aug-9l 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/715300 

(B) APPLICATION DATE: 14-June-91 

45 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/683400 

(B) APPLICATION DATE: lO-Apr-91 

50 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/621667 

(B) APPLICATION DATE: 03-Dec-1990 

(viii) ATTORNEY/AGENT INFORMATION: 
55 (A) NAME: Benson, Robert H. 

(B) REGISTRATION NUMBER: 30,446 

(C) REFERENCE /DOCKET NUMBER: 645P4 

(ix) TELECOMMUNICATION INFORMATION: 
60 (A) TELEPHONE: 415/266-1489 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

5 

GGCAGCTGTG GCTTCTAGAG TGGCGGCGGC TCTGGT 36 

10 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTHS 36 bases 
(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0s2: 

20 

AGCTGTGGCT TCGGGCCCTT AGCATTTAAT GCGGTA 36 
(2) INFORMATION FOR SEQ ID NO: 3: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
35 TTCACAAACG AAGGGCCCCT AATTAAAGCC AGA 33 

(2) INFORMATION FOR SEQ ID NO: 4: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
CAATAATAAC GGGCTAGCCA AAAGAACTGG 30 

50 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5; 

CACGACAGAA TTCCCGACTG GAAA 24 
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(2) INFORMATION FOR SEQ ID NO: 6: 
(1) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CTGTTTCTAG AGTGAAATTG TTA 23 

10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ACATTCCTGG GTACCGTGCA G 21 

25 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 63 bases 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

35 

GCTTCAGGAA GGACATGGAC NNSGTCNNSA CANNSCTGNN SATCGTGCAG 50 



40 TGCCGCTCTG TGG 63 



(2) INFORMATION FOR SEQ ID NO: 9: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



AAGGTCTCCA CATACCTGAG GATC 24 

55 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGACAAGG TGTCGACATA CCTGCGCATC GTG 33 



65 



10 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



15 



30 



50 



GGCAGCTGTG GCTTCTAGAG TGGCGGCGGC TCTGGT 3 6 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 36 bases 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGCAGCTGTG GATTCTAGAG TGGCGGTGGC TCTGGT 36 



(2) INFORMATION FOR SEQ ID NO: 13: 



(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

40 Gly Ser Cys Gly Phe Glu Ser Gly Gly Gly Ser Gly 

1 5 10 12 

<2) INFORMATION FOR SEQ ID NO: 14: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



55 



CGGACTGGGC AGATATTCAA GCAGACC 27 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 38 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CTCAAGAACT ACGGGTTACC CTGACTGCTT CAGGAAGG 38 
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(2) INFORMATION FOR SEQ ID NO: 16: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CGCATCGTGC AGTGCAGATC TGTGGAGGGC 30 

15 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 66 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GTTACTCTAC TGCTTTCAGG AAGGACATGG ACNNSGTCNN SACANNSCTG 50 

30 

NNSATCGTGC AGTGCA 66 



(2) INFORMATION FOR SEQ ID NO: 18: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



45 GATCTGCACT GCACGATSNN CAGSNNTGTS NNGACSNNGT CCATGTCCTT 50 



CCTGAAGCAG TAGA 64 

50 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
55 (B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

60 

GCCTTTGACA GGTACCAGGA GTTTG 25 



65 (2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 bases 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



CCAACTATAC CACTCTCGAG GTCTATTCGA TAA 33 



<2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



TCGAGGCTCN NSGACAACGC GNNSCTGCGT GCTNNSCGTC TTNNSCAGCT 50 



GGCCTTTGAC ACGTAC 66 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GTGTCAAAGG CCAGCTGSNN AAGACGSNNA GCACGCAGSN NCGCGTTGTC 50 



SNNGAGCC 58 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GTTACTCTAC TGCTTCNNSA AGGACATGNN SAAGGTCAGC NNSTACCTGC 50 



GCNNSGTGCA GTGCA 65 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



GATCTGCACT GCACSNNGCG CAGGTASNNG CTGACCTTSN NCATGTCCTT 50 

5 

SNNGAAGCAG TAGA 64 



10 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2178 bases 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

20 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 



TGCTACAAAC GCGTACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 

25 

TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCCG TGCCAGTCAG 150 



30 GATGTGAATA CTGCTGTAGC CTGGTATCAA CAGAAACCAG GAAAAGCTCC 200 



GAAACTACTG ATTTACTCGG CATCCTTCCT CTACTCTGGA GTCCCTTCTC 2 50 

35 

GCTTCTCTGG ATCCAGATCT GGGACGGATT TCACTCTGAC CATCAGCAGT 300 



CTGCAGCCGG AAGACTTCGC AACTTATTAC TGTCAGCAAC ATTATACTAC 350 

.40 

TCCTCCCACG TTCGGACAGG GTACCAAGGT GGAGATCAAA CGAACTGTGG 400 



45 CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT 450 



GGAACTGCCT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC 500 

50 

CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 



AGAGTGTCAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC 600 

55 

ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG 650 



60 CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 700 



GGGGAGAGTG TTAAGCTGAT CCTCTACGCC GGACGCATCG TGGCCCTAGT 7 50 

65 

ACGCAAGTTC ACGTAAAAAG GGTATCTAGA GGTTGAGGTG ATTTTATGAA 800 



AAAGAATATC 
CAAACGCGTA 
CAGCCAGGGG 
TAAAGACACC 
AATGGGTTGC 
AGCGTCAAGG 
CTACCTGCAG 
GTTCTAGATG 
GGAACCCTGG 
CCCCCTGGCA 
GCTGCCTGGT 
TCAGGCGCCC 
CTCAGGACTC 
TGGGCACCCA 
AAGGTGGACA 
GCCCTTCGTT 
CTGTCAATGC 
GGTGGTGGCT 
CGGTTCCGGT 
CAAACGCTAA 
CAGTCTGACG 
TGCTATCGAT 
GTGCTACTGG 



GCATTTCTTC 
CGCTGAGGTT 
GCTCACTCCG 
TATATACACT 
AAGGATTTAT 
GCCGTTTCAC 
ATGAACAGCC 
GGGAGGGGAC 
TCACCGTCTC 
CCCTCCTCCA 
CAAGGACTAC 
TGACCAGCGG 
TACTCCCTCA 
GACCTACATC 
AGAAAGTTGA 
TGTGAATATC 
TGGCGGCGGC 
CTGAGGGTGG 
GGTGGCTCTG 
TAAGGGGGCT 
CTAAAGGCAA 
GGTTTCATTG 
TGATTTTGCT 



TTGCATCTAT 
CAGCTGGTGG 
TTTGTCCTGT 
GGGTGCGTCA 
CCTACGAATG 
TATAAGCGCA 
TGCGTGCTGA 
GGCTTCTATG 
CTCGGCCTCC 
AGAGCACCTC 
TTCCCCGAAC 
CGTGCACACC 
GCAGCGTGGT 
TGCAACGTGA 
GCCCAAATCT 
AAGGCCAATC 
TCTGGTGGTG 
CGGTTCTGA6 
GTTCCGGTGA 
ATGACCGAAA 
ACTTGATTCT 
GTGACGTTTC 
GGCTCTAATT 



GTTCGTTTTT 
AGTCTGGCGG 
GCAGCTTCTG 
GGCCCCGGGT 
GTTATACTAG 
GACACATCCA 
GGACACTGCC 
CTATGGACTA 
ACCAAGGGCC 
TGGGGGCACA 
CGGTGACGGT 
TTCCCGGCTG 
GACTGTGCCC 
ATCACAAGCC 
TGTGACAAAA 
GTCTGACCTG 
GTTCTGGTGG 
GGTGGCGGCT 
TTTTGATTAT 
ATGCCGATGA 
GTCGCTACTG 
CGGCCTTGCT 
CCCAAATGGC 



TCTATTGCTA 
TGGCCTGGTG 
GCTTCAACAT 
AAGGGCCTGG 
ATATGCCGAT 
AAAACACAGC 
GTCTATTATT 
CTGGGGTCAA 
CATCGGTCTT 
GCGGCCCTGG 
GTCGTGGAAC 
TCCTACAGTC 
TCTAGCAGCT 
CAGCAACACC 
CTCACACAGG 
CCTCAACCTC 
CGGCTCTGAG 
CTGACGGAGG 
GAAAAGATGG 
AAACGCGCTA 
ATTACGGTGC 
AATGGTAATG 
TCAAGTCGGT 



850 

900 
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1150 
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1300 

1350 

1400 
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1600 
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GACGGTGATA ATTCACCTTT AATGAATAAT TTCCGTCAAT ATTTACCTTC 2000 
CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTAGC GCTGGTAAAC 2050 

5 

CATATGAATT TTCTATTGAT TGTGACAAAA TAAACTTATT CCGTGGTGTC 2100 
10 TTTGCGTTTC TTTTATATGT TGCCACCTTT ATGTATGTAT TTTCTACGTT 2150 

TGCTAACATA CTGCGTAATA AGGAGTCT 2178 



15 



30 



45 
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(2) INFORMATION FOR SEQ ID NO:26: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 237 amino acids 
20 {B) TYPE: amino acid 

{D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

25 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

15 10 15 



Ser lie Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 

20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 

35 40 45 



Cys Arg Ala Ser Gin Asp Val Asn Thr Ala Val Ala Trp Tyr Gin 

35 50 55 60 

Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Ser Ala Ser 

65 70 75 

40 Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Arg Ser 

80 85 90 

Gly Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp 

95 100 105 



Phe Ala Thr Tyr Tyr Cys Gin Gin His Tyr Thr Thr Pro Pro Thr 
110 115 120 



Phe Gly Gin Gly Thr Lys Val Glu He Lys Arg Thr Val Ala Ala 
50 125 130 135 

Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser 
140 145 150 

55 Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg 

155 160 165 

Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly 
170 175 180 



Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
185 190 195 



Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 

65 200 205 210 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu ser 

215 220 225 
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Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
230 235 237 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 

20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
20 35 40 45 

Ala Ala Ser Gly Phe Asn He Lys Asp Thr Tyr He His Trp Val 
50 55 60 

25 Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg He Tyr 

65 70 75 



Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val Lys Gly Arg 

80 85 90 

Phe Thr He Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr Leu Gin 

95 100 105 



Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ser 
35 110 115 120 

Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly Gin 
125 130 135 

40 Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser 

140 145 150 



Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 

155 160 165 

Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 

170 175 180 



Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 
50 185 190 195 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 
200 205 210 

55 Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 

215 220 225 



Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys 

230 235 240 

Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Gly Pro Phe Val 

245 250 255 



Cys Glu Tyr Gin Gly Gin Ser Ser Asp Leu Pro Gin Pro Pro Val 

65 260 255 270 

Asn Ala Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Glu 

275 280 285 
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Gly Gly Gly Ser Glu Gly Gly Gly Ser Glu Gly Gly Gly Ser Glu 

290 295 300 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr 

5 305 310 315 

Glu Lys Met Ala Aen Ala Asn Lys Gly Ala Met Thr Glu Asn Ala 

320 325 330 

10 Asp Glu Asn Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser 

335 340 345 
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Val Ala Thr Asp Tyr Gly Ala Ala He Asp Gly Phe He Gly Asp 

350 355 360 

Val Ser Gly Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala 

365 370 375 



Gly Ser Asn Ser Gin Met Ala Gin Val Gly Asp Gly Asp Asn Ser 
20 380 385 390 

Pro Leu Met Asn Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin 
395 400 405 

25 Ser Val Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr 

410 415 420 



Glu Phe Ser He Asp Cys Asp Lys lie Asn Leu Phe Arg Gly Val 

425 430 435 

Phe Ala Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser 

440 445 450 



Thr Phe Ala Asn lie Leu Arg Asn Lys Glu Ser 
35 455 450 461 



What is claimed is: 



A method for selecting novel binding polypeptides comprising: 

{ a ) constructing a repiicat)le expression vector comprising 

a transcription regulatory element operably linked lo 
agene fusion encoding a fusion protein wherein the gene fusion comprises 
a first gene encoding a polypeptide, and 
a second gene encoding at least a portion of a 
phage coat protein; 



( b) mutating the vector at one or more selected positions within the first gene thereby forming a 

family of related plasmids; 
{c ) transforming suitable host cells with the plasmids; 



{ d ) infecting the transformed host cells with a helper phage 
having a gene encoding the phage coat protein; 



{ e ) cufturing the transformed infected host cells under conditions 

suitable for forming recombinant phagemid particles containing at least a portion of 
the piasmtd and capable of transforming the host, the conditions adjusted so that no 
more than a minor amount of phagemid particles display more than one copy of the 
fusion protein on the surface of ttie particle; 



( f ) contacting the phagemid particles with a target molecule so that at least a portion of the 

phagemid particles bind to the target molecule; and 

( g ) separating the phagemid particles that bind from those that do not 

The method of claim 1 further comprising infecting a suitable host cells with the phagemid particles that 
bind and repeating steps (d) through (g). 

The method of daim 2 wherein the steps are repeated one or more limes. 

The method of daim 1 wherein the expression vector further comprises a secretory signal sequence. 

The method of daim 1 wherein the transcription regulatory element Is a promoter system selected 
from the group; lad, pho A, tryptophan, iac, XpL, bacteriophage T7, and combinations 
thereof. 
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6. The method of ciaim 1 wherein the first gene erxxxtes a mammalian protein. 

I, The method of daim 6 wherein the protein is selected from the group; 

5 growth hormone, human growth hormone(hGH), des-N-methionyf human growth homione, 

bovine growth hormone^ parathyroid horrrione, thyroxine, insulin A-chain, insulin B-<*)ain, 
proinsulin, reiaxin A-chain, relaxin B-chain, profBlaxin, follicle stimulating hormone(FSH), 
thyroid stimulating hormone(TSH), ieutinizing hormone(LH), glycoprotein honrjone receptors, 
calcitonin glucagon, factor Vlll, an antibody, lung surfactant urokinase, streptokinase, human 

1 0 tissue-type plasminogen adivalor (t-PA), bombesin, factor iX, thrombin, hemopoietic growth 

factor, tumor necrosis factor-alpha and ^ta, enkephalinase, human serum albumin, mullerian- 
inhibiting substance, mouse gonadotropin-assodated peptide, p-lactamase, tissue factor 
protein, inhibin, activin, vascular endothelial growth factor, ^tegrin receptors, thrombopoietin, 
protein A or D, rheumatoid factors, NGF-p, platelet-growth factor, transforming growth 

1 5 factor ; TGF-alpha and TGF-beta, insulin-like growth factor-l and -II, insulin-like growth 

factor tending proteins . CD-4, DNase, latency associated peptide, erythropoietin, HER2 
ligands, osteoinductive factors, interferon-alpha, -beta, and -gamma, colony stimulating 
factors (CSFs), M-CSF, GM-CSF. and G-CSF. interieukins (ILs). IL-1, IL-2, IL-3, 11-4. 
superoxide dismutase; decay accelerating factor, viral antigen, HIV envelope proteins GP120 

20 and GP140, atrial natriuretic peptides A, B, or C, or immune globulins, and fragments of the 

above-listed proteins, 

8. The method of daim 7 wherein the protein is a human protein. 

25 9. The method of daim 8 wherein the protein comprises more than about 100 amino acid residues. 

10. The method of daim 1 wt^rein the protein comprises a plurality of rigid secondary structures displaying 
amino acids capable of interacting with the target, and the mutations are primarily produced 
at portions corresponding to codons encoding the amino adds. 

30 

II. The method of daim 10 wherein the rigid secondary structures comprise structures selected from the 

group; a-{3.6i3)helix, 3io helix, 7t-(4.4ie)helix, parallel and anti-parallel p-p)eated sheets, 
reverse turns, and non-ordered structures. 

35 12. The method of daim 1 0 wherein the mutations are prodiced at more to one codon. 

13. The method of ciaim 12 wherein the mutations are produced on more than one rigid secondary structure. 
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14. The method of ciaim 1 wherein the helper phage is selected from the group M13K07, M13R408, M13- 

VCS. and Phi X 174. 

15. The method of daim 14 wherein the helper phage is M13K07 and the coat protein is the M13 phage gene 

ill coat protein. 

5 

16. The method of daim 1 5 wherein the host is £ ooli. 

17. The method of daim 16 wherein the plasmid is under tight contro! of the transcription regulatory 

element 

10 

18. The method of daim 17 wherein the amount is less than about 1%. 



19. The method of daim 18 wherein the amount is less tftan 20% the amount of phagemid particles displaying 

a single copy of the fuaon protein. 

15 

20. The method of daim 19 wherein the amount is less than 10%. 



21. The method of daim 1 further comprising in step (a), inserting a DNA triplet, encoding an mRNA 

suppressible terminator codon between said first gene encoding a polypeptide, and said 
2 0 second gene encoding at least a portion of a phage coat protein. 

22. The method of daim 21 wherein said mRNA suppressible terminator codon is selected from the 

foliowing: UAG (amber), UAA (ocher) and UGA (opel). 



25 23. The method of daim 22 wherein said suppressible mutation results in the detectable production of a 
fusion polypeptide containing sadi polypeptide and said coat protein when said expression 
vector is grown in a suppressor host cell; and, when grown in a non-suppressor host cell said 
polypeptide is synthesized substantially without fusion to said phage coat protein. 



30 24 . A human growth hormone variant wherein hGH amino acids 1 72, 1 74, 1 76 and 1 78 respectively are as a 
group sequentially selected from one of the following: {1)R,S,F,R; (2)RAYtR; (3)K,T,Y,K; 
(4}R,S,Y,R; (5)K,A,Y,R; (6)R,F,F,R; {7)KAY,R: (8) RXY,H; (9)Q,R.Y,R; (10)K,K,Y,K; 
(11)R,S,F,S; and (12)K,S,N.R. 



35 25. A phagemid comprising a replicat^e expression vector comprising a transcription regulatory element 
operably linked to a gene fusion enoDding a fu^on protein wherein the gene fusion comprises a 
first gene encoding a polypeptide, and a second gene encoding at least a portion of a phage 
coat protein, wherein a DNA triplet codon encoding an mRNA suppressible tenninator codon 
selected from UAG, UAA and UGA is inserted between the fused ends of the first and second 
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genes, or is substituted for an amino acid encoding triplet codon adjacent to the gene fusion 
junction. 

The phagemid of daim 25 wherein said first gene encodes a mammalian protein. 

The phagemid of daim 26 wherein the protein is selected from ttie group: growth hormone, human 
growth homione (hGH), des-N-meWonyi human growth hormone, bovine growth hormone, 
parathyroid hormone, thyroxine, insulin A-chain, insulin B-chain, proinsuiin, relaxin A-chain, 
reiaxin B-chain, prorelaxin, follicle stimulating horrrK)ne (FSH), thyroid stimulating honnone 
(TSH), ieutinizing hormone (LH), glycoprotein hormone receptors, calcitonin, glucagon, factor 
Vill, an antibody, lung surfactant, urokinase, strepte>kina$e, human tissue-type plasminogen 
activator (t-PA), bomt^esin, factor IX, thrombin, hemopoietic growth factor, tumor necrosis 
factor-alpha and-beta, enkephalinase, human senjm albumin, mullerian-inhtbiting substance, 
mouse gonadotropin-assoctated peptide, p-lactamase, tissue factor protein, inhibin, activin, 
vascular endothelial growth factor, integrin receptors, thrombopoietin, protein A or D, 
rheumatoid factors, NGF-p, platelet-growth factor, transforming growth factor; TGF-alpha 
and TGF-beta, insuiin-like growth-l and -11, insulin-like growth factor binding proteins, CD-4, 
DNase, latency associated peptide, erythropoietin, osteoinductive factors, interferon-alpha, - 
beta, and -gamma, colony stimulating factors (CSFs), M-CSF. GM-GSF, and G-CSF. 
interieukins (!Ls), )L-1, tL-2, IL-3. IL-4, superoxide dismutase; decay accelerating factor, viral 
antigen, HIV envelope proteins GP120 and GP140, atrial natriuretic peptides A, B or 0 
immuno globulins, and fragments of the above-listed proteins. 

The phagemid of claim 27 wherein said protein is a human protein. 

The phagemid of claim 28 wherein the protein comprises more than about 100 amino ackJ residues. 

The phagemki of claim 25 wherein said protein comprises a plurality of rigkj secondary structures 
displaying amino adds capable of interacting with the target 

The phagemid of daim 30 wt^rein sakJ rigkj secondary structures comprises structures selected from 
fr)e group; a-(3.6i3)helix, 3io helix, 7c-(4.4i6)heiix, parallel and anti-parallel p-piealed 
sheets, reverse turns, and non-ordered structures. 

The phagemid of daim 25 wherein ^ helper phage is selected from the group M13K07, M13R408, 
M13-VCS. and Phi X 174. 

The phagemkJ of daim 32 wherein the helper phage is M13K07 and the coat protein is the M13 phage 
gene 111 coat protein. 
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34 . The phagemid of daim 33 wherein the host is the £. co// wild type or suppressor type. 

35. The phagemid of daim 34 wherein the piasmid is urKier tight control of the transcription regulatory 
5 element 

36. The phagemid of daim 35 wherein the nunber of phagemid partides dispiaying more tfian one copy of 

the fusion protein on the surface of the particles is less than 1%. 

10 37. The phagemid of daim 36 wherein said number of phagemid particies is less than about tO%. 

38. The phagemid of daim 37 wherein the rKjmber of phagemid partides is less than about 20%. 

39. A human g-owth variant wherein hGH amino acids 10, 14, 18,and21 respectively are as a group 
1 5 sequentially selected from one of the following: 

(1)H,G,N,N; (2)A,W,D,N; {3)F,S,F,L; (4)yj,V,N and (5)I,NXN. 

40 . A human growth variant wherein hGH amino ^s 174 is serine and 176 is tyrosine and hGH amino adds 

1 67, 171 J 75 and 1 79 respectively are as a group seque^^lially selected from one of the 
20 following: 

{1)N,SJJ; {2)E,SJ,I; (3)K,S,T,L; (4)N,NJJ; (5) R.DJj; and {6)N,ST,Q. 

41 . A method for selecting novel binding polypeptides comprising 

(a ) constructing a replicable expres&on vector comprising a 

25 transcription regulatory element operably linked to DNA encoding a protein of 

interest containing one or more subunits, wherein the DNA encoding at least one of 
the subunits is fused to the DNA encoding at least a portion of a phage coat protein; 

( b ) mutating the DNA encoding the protein of interest at one or more selected positions thereby 
30 forming a family of related vectors; 

{c ) transforming suitable host cells with the vectors: 

( d ) infecting the transformed host cells with a helper phage having a gene encoding the phage 
35 coat protein; 

(e) culturing the transformed infected host cells under conditions suitaWe for forming 

recombinant phagemid partides containing at least a portion of the piasmid and 
capable of transforming the host, the conditions adjusted so that no more than a 
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minor amount of phagemid partides display more than one copy of the fusion protein 
on the surface of the particle; 

( f ) contacting the phagemid partides with a target molecule so thai at least a portion of the 
5 phagemid partides bind to the target molecule; and 

( g ) separating the phagemid partides that fcxnd from those that do not 

42 . The method of daim 41 wherein the expression vector further compnses a secretory signal sequence 
1 0 operably linl^ to the DNA encoding each subunit of the protein of interest. 

43 . The method of daim 42 wherein the protein of interest is a mammalian protein. 

44 . The method of daim 43 wherein the protein of interest is selected from the group; 

1 5 insulin, relaxin, foilide stimulating homione (FSH), thyroid stimulating hormone (TSH), 

leutinlzing hormone (LH), glycoprotein hormone receptors, monoclonal and polydonai 
antibodies, lung surfactant, integrin receptors, insulin-like growth factor- 1 and -li, and 
fragments of the above-listed proteins. 

20 45 . The method of daim 44 wherein the protein of interest is a humanized antibody. 

46 . The method of claim 45 wherein the protein of interest is a humanized Fab fragment capable of binding 
to the HER-2 receptor (human epidermal growth factor receptor-2), 

25 47 A human growth hormone (hGH) variant wherein hGH amino add glutamate 1 74 is replaced by serine 1 74 
and phenylalanine! 76 is reftoced by tyrosinei76 and one or more of the eight naturally 
occurring hGH amino adds F10, M14, H18, H2t, R167, D171, T175 and 1179 are reir^aced by 
another natural amino add. 

30 48 The hGH variant of daim 47 wherein the eight naturally occurring hGH amino adds F1 0, M 1 4, H1 8, H2 1 , 
R t67. D1 71 , T1 75 and II 79 respectively are as a group replaced with a con^sponding amino 
add sequentially selected from one of the following groups: 
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49. The metfxxi of claim 48 wherein s^d human growth hormone variant (1 1 ) further contains teucinet 5 replaced 

by argininei5 and tysineies replaced by argininetss- 

15 

50. The method of claim 48 wherein said human growth hormone variant (40) further contains phenylalanine 1 75 



51. A method for selecting novel binding polypeptides comprising: 



20 ( a ) constructing a rep^icabie expression vector comprising 

a transcription regulatory element operably linked to 

a gene fusion encoding a fusion protein wherein ttie gene fusion comprises 

a first gene encoding a polypeptide operable connedted to a linking amino 

addsequence, and 

25 a second gene encoding at least a portion of a 

phage coat protein; 



( b ) mutating the vector at one or more selected positions within the amino add linking sequence of 
the first gene thereby fomiing a family of related plasmids; 

30 

(c) transforming suitaWe host cells with the plasmids; 



{ d ) infecting the transformed host cells with a helper phage 
having a gene encoding the phage coat protein; 

35 

(e) culturing the transformed infected host cells under conditions 

suitable tor forming recombinant phagemid particles containing at least a portion of 
the piasmid and capable of transforming the host, the conditions adjusted so that no 
more than a minor amount of phagemid particies display more than one copy of tf>e 
40 fusion protein on the surface of the particle; 



contacting the phagemid particles with a target molecule so that at least a portion of the 
phagemid particles bind to the target molecule; and 
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( g ) contacting the bound phagemid partic^s wilfi a protease capable of hydroiysing the linking a 
amtnk) acid sequence of at least a portion of the bound phagmid particles, and 

( h ) isolating the hydrolyzed phagmid parades. 

The method of claim 51 furttier comprising infecting suitable host ceBs with the hydrolyzed phagemid 
particles and repeating steps (d) through (h). 
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ABSTRACT OF THE DISCLOSURE 
ENRICHMENT METHOD FOR GROWTH HOMONE VARIANTS WTTH ALTERED BINDING PROPERTIES 

A method for selecting novel proteins such as growth hormone and antibody fragment variants having 
altered binding properties for their respective receptor molecules is provided. The method comprises fusing a 
gene encoding a protein of interest to the carboxy terminal domain of the gene III coat protein of the filamentous 
phage M1 3. The gene fusion is mutated to form a library of stnx;turally related fusion proteins that are 
expressed in low quantity on the surface of a phagemid particle. Biological selection and screening are employed to 
identify novel tigands useful as drug candidates. Disclosed are preferred phagemid expression vectors and 
selected human growth hormone variants. 
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Attorney's Docket No. 645P4 
COMBINED DECLARATION FOR PATENT APPLICATION AND POWER OF ATTORNEY 

As a beiow named inventor, I hereby declare that: 

My residence, post office and citizenship are as stated below next to my name, 

I believe I am the original and sole inventor (if only one name is listed below) or an original, first and joint inventor 
(if plural names are listed below) of the subject matter which Is claimed and for which a patent is sought on the 
invention entitled: 

ENRICHMENT METHOD FOR VARIANT PROTEINS WITH ALTERED BINDING PROPERTIES 
the specification of which (check only one item below): 
is attached hereto. 

was filed as United States application Serial No. on 

and was amended on (if applicable.). 

X was filed as PCT international application Number PCT/US91 /09133 on 3 December 1991 and was 
amended under PCT Article 19 on (if applicable). 

I fiireby state that I have reviewed and understand the contents of the above-identified specification, including the 
cl#ns, as amended by any amendment referred to above. 

I l^nowledge the duty to disclose information which is materia! to the examination of this application in accordance 
with Title 37, Code of Federal Regulations, 51.56(a). 

I bfreby claim foreign priority benefits under Title 35, United States Code, §119 of any foreign application(s) for 
pflant or inventor's certificate or of any PCT international app!ication{s) designating at least one country other than 
triye United States of America listed below and have also identified below any foreign app!ication(s) for patent or 
iayfntor's certificate or any PCT international application(s) designating at least one country other than the United 
Sial:es of America filed by me on the same subject matter having a filing date before that of the app(ication{s) of 
which priority is claimed. 

PflpR FOREIGIM/PCT APPLICATION(S) AND ANY PRIORITY CLAIMS UNDER 35 U.S.C. 119: 

I: I COUNTRY APPLICATION NUMBER DATE OF FILING PRIORITY CLAIMED 

{day, month, year) UNDER 35 USC 1 1 9 



I hereby claim the benefit under Title 35, United States Code, §120 of any United States application(s) or PCT 
international appiication{s) designating the United States of America that is/are listed below and, insofar as the 
subject matter of such of the claims of this application is not disclosed in that/those priori application(s) in the manner 
provided by the first paragraph of Title 35, United States Code §112, I acknowledge the duty to disclose material 
information a defined in Title 37, Code of Federal Regulations, §1 .56(a) which occurred between the filing date of 
the prior application(s) and the national or PCT international filing date of this application: 
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Attorney's Docket No. 645P4 

COMBINED DECLARATION FOR PATENT APPLICATION AND POWER OF ATTORNEY (Continued) 

PRiOR U.S. APPLICATIONS OR PCI INTERNATIONAL APPLICATIONS DESIGNATING THE U.S. FOR BENEFIT UNDER 
35 U.S.C. 120: 

U.S. APPLICATIONS STATUS (Check one} 



U.S. Appticatfon Number | 


U.S. Filing Date 


Patented 


1 Pending 


{ Abandoned 


07/743,614 j 


JB^ August 1991 




! X 




07/715,300 1 


14 June 1991 




1 X 




07/683,400 1 


10 April 1991 




1 X 




07/621,667 1 


3 December 1990 








PCT APPLICATIONS DESIGNATING THE U.S. | j | 


POT Application No. | PCT Filing Date 


j U.S. Serial Numbers 








1 1 111 



POWER OF ATTORNEY: As a named inventor, I hereby appoint the following attorney(s) and/or 
dgent(s) to prosecute this application and transact all business in the Patent and Trademark Office 
connected therewith. 



Carolyn R. Adier 
Walter E, Buting 
Ginger R. Dreger 
Daryl B. Winter 
Dennis G. Kleid 

Genentech, Inc. 
Attn: Renee A. Fitts 
460 Point San Bruno Boulevard 
South San Francisco, CA 94080-4990 
Telephone: (415)225-1489 

I heteby declare that all statements made herein of my own knowledge and belief are believed to be true; and further 
that:; these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment or both, under Section 1 001 of Title 1 8 of the United States Code and that willfut 
false statements may jeopardize the validity of the application or any patent issue thereon, 

; i'" The undersigned hereby authorizes the U.S. attorney or agent named herein at accept and follow instructions from his 

f 3 foreign patent agent as to any action to be taken in the Patent and Trademark Office regarding this application without 

J, ..^ direct communication between the U.S. attorney or agent and the undersigned, in the event of a change in the persons 

' from whom instructions may be taken, the U.S. attorney or agent named herein will be so notified by the undersigned. 



- Reg. No. 32,324 Renee A. Fitts - Reg. No. 30,446 

- Reg. No. 23,092 Janet E. Hasak - Reg. No. 28,616 

- Reg. No. 33,055 Stephen Raines - Reg. No. 25,91 2 

- Reg. No. 32,637 Sean A. Johnston - Reg. No. 35,910 

- Reg. No. 32,037 



Seridi correspondence to 



Full name of sole or first inventor 
Lisa J. Garrard 




nyiiatuic ^ J yj urate 

Ajtfenuf 



Residence // 

1313 Grove Aj^^ue, Burlingame, California 94010, United States of America 



Citizenship 
United States 



Post Office Address 

1313 Grove Avenue, Burlingame, California 94010, United States of America 



Full name of second or joint inventor, if any 
Dennis J. Henner 



Second Inv€|htor's signature Date 




Residence 

297 Angelita Aventie, Pacifica, California 94044, United States of America 



Citizenship 
United States 



=*ost Office Address 

E97 Angelita Avenue, Pacifica, Caiifomia 94044, United States of America 
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Full name of third or joint inventor, if any 
Steven Bass 



Third Inventor's signefcjre j 



Residence 

1355 Sierra Street, Redwood City, California 94061, United States of America 



Citizenship 
United States 



Post Office Address 

1 355 Sierra Street. Redwood City, Caiifornra 94061, United States of America 



Fuii name of fourth or joint inventor, if any 
Ronald Greene 



Fourth Inventor's signature . Date 
Residence 

1014 Norwood Avenue, Durham, North Carolina 2.1701, United States of America 

Citizenship 

United States 

Post Office Address 

1014 Norwood Avenue, Durham, North Carolina 27707, United States of America 
Full naifie of fifth or joint inventor, if any 
Henry^i. Lowman 

Fifth iKkentor's signature ^ / Date 



Residlm^e 



4. 



205 Fflfeon W*y, Hercules, California 94547, United States of America 



Citizei^ahip 
UnitedStates 



Post Office Address 

205 Ffleon Way, Hercules, California 94547, United States of America 



Full na-pe of sixth or joint inventor, if any 
JamesrA. Wells 



Sixth Inventor's signature Date 




Resideilcje 

1341 Columbus Avenue, fedriingame, California 94010, United States of America 



Citizenship 
United States 



Post Office Address 

1341 Columbus Avenue, Burlingame, California 94010, United States of America 



Full name of seventh or joint inventor, if any 
David J. Matthews 




Date 



-Hijaneis^^„Calilcirjiia_S4122-r United States of America 



'citizenship 
nited States 



ost Office Address 

527 Noriega Street, San Francisco, California 94122, United States of America 



Pare P. nf 3 



Sequence Listing 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Garrard, Lisa J. 

Henner, Dennis J. 
Bass, Steven 
Greene, Ronald 
Lowman, Henry B. 
Wells, James A, 
Matthews, David J. 

(ii) TITLE OF INVENTION: ENRICHMENT METHOD FOR VARIANT PROTEINS WITH 

ALTERED BINDING PROPERTIES 

(iii) NUMBER OF SEQUENCES: 43 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 1 DNA Way 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.4 4 Mb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 

(vi) CURRENT APPLICATION DATA: 
CA) APPLICATION NUMBER: 

(B) FILING DATE: 21-NOV-2000 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/922345 

(B) FILING DATE: 03~SEP-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/463587 

(B) FILING DATE: 05~JUN-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/050058 

(B) FILING DATE: 30~APR-1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US91/09133 

(B) FILING DATE: 03-DEC-1991 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/743614 

(B) FILING DATE: 09-AUG-1991 
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PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/715300 

(B) FILING DATE: 14-JUN-1991 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/683400 

(B) FILING DATE: lO-APR-1991 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/621667 

(B) FILING DATE: 03-DEC-1990 

ATTORNEY /AGENT INFORMATION: 

(A) NAME: Schwartz, Timothy R. 

(B) REGISTRATION NUMBER: 32171 

(C) REFERENCE/DOCKET NUMBER: P0645P4D2C3 

TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-7467 

(B) TELEFAX: 650/952-9881 
(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



GGCAGCTGTG GCTTCTAGAG TGGCGGCGGC TCTGGT 36 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



AGCTGTGGCT TCGGGCCCTT AGCATTTAAT GCGGTA 36 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



TTCAC7\AACG 7VAGGGCCCCT AATTAAAGCC AGA 33 



(vii) 



(vii) 



(vii) 



(viii) 
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(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 



CAATAATAAC GGGCTAGCCA AAAGAACTGG 30 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



CACGACAGAA TTCCCGACTG GAAA 24 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 



CTGTTTCTAG AGTGAAATTG TTA 23 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



ACATTCCTGG GTACCGTGCA G 21 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



GCTTCAGGAA GGACATGGAC NNSGTCNNSA CANNSCTGNN SATCGTGCAG 50 
TGCCGCTCTG TGG 63 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



AAGGTCTCCA CATACCTGAG GATC 24 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGACAAGG TGTCGACATA CCTGCGCATC GTG 33 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GGCAGCTGTG GATTCTAGAG TGGCGGTGGC TCTGGT 36 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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Gly Ser Cys Gly Phe Glu Ser Gly Gly Gly Ser Gly 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CGGACTGGGC AGATATTCAA GCAGACC 27 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



CTCAAGAACT ACGGGTTACC CTGACTGCTT CAGGAAGG 38 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CGCATCGTGC AGTGCAGATC TGTGGAGGGC 30 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16; 



GTTACTCTAC TGCTTTCAGG AAGGACATGG ACNNSGTCNN SACANNSCTG 50 
NNSATCGTGC AGTGCA 66 
(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHAF^CTERISTICS : 

(A) LENGTH: 64 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GATCTGCACT GCACGATSNN CAGSNNTGTS NNGACSNNGT CCATGTCCTT 50 
CCTGAAGCAG TAGA 64 
(2) INFORMATION FOR SEQ ID NO: 18: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



GCCTTTGACA GGTACCAGGA GTTTG 25 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



CCAACTATAC CACTCTCGAG GTCTATTCGA TAA 33 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



TCGAGGCTCN NSGACAACGC GNNSCTGCGT GCTNNSCGTC TTNNSCAGCT 50 

GGCCTTTGAC ACGTAC 66 

(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 base pairs 
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(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GTGTCAAAGG CCAGCTGSNN AAGACGSNNA GCACGCAGSN NCGCGTTGTC 50 
SNNGAGCC 58 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GTTACTCTAC TGCTTCNNSA AGGACATGNN SAAGGTCAGC NNSTACCTGC 50 
GCNNSGTGCA GTGCA 65 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GATCTGCACT GCACSNNGCG CAGGTASNNG CTGACCTTSN NCATGTCCTT 50 
SNNGAAGCAG TAGA 64 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2178 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTACAAAC GCGTACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCCG TGCCAGTCAG 150 
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GATGTGAATA CTGCTGTAGC CTGGTATCAA CAGAAACCAG GAAAAGCTCC 200 

GAAACTACTG ATTTACTCGG CATCCTTCCT CTACTCTGGA GTCCCTTCTC 250 

GCTTCTCTGG ATCCAGATCT GGGACGGATT TCACTCTGAC CATCAGCAGT 300 

CTGCAGCCGG AAGACTTCGC AACTTATTAC TGTCAGCAAC ATTATACTAC 350 

TCCTCCCACG TTCGGACAGG GTACCAAGGT GGAGATCAAA CGAACTGTGG 4 00 

CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT 4 50 

GGAACTGCCT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC 500 

CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 

AGAGTGTCAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC 600 

ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG 650 

CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 7 00 

GGGGAGAGTG TTAAGCTGAT CCTCTACGCC GGACGCATCG TGGCCCTAGT 7 50 

ACGCAAGTTC ACGTAAAAAG GGTATCTAGA GGTTGAGGTG ATTTTATGAA 8 00 

AAAGAATATC GCATTTCTTC TTGCATCTAT GTTCGTTTTT TCTATTGCTA 8 50 

CAAACGCGTA CGCTGAGGTT CAGCTGGTGG AGTCTGGCGG TGGCCTGGTG 900 

CAGCCAGGGG GCTCACTCCG TTTGTCCTGT GCAGCTTCTG GCTTCAACAT 950 

TAZ\AGACACC TATATACACT GGGTGCGTCA GGCCCCGGGT AAGGGCCTGG 1000 

AATGGGTTGC AAGGATTTAT CCTACGAATG GTTATACTAG ATATGCCGAT 1050 

AGCGTCAAGG GCCGTTTCAC TATAAGCGCA GACACATCCA AAAACACAGC 1100 

CTACCTGCAG ATGAACAGCC TGCGTGCTGA GGACACTGCC GTCTATTATT 1150 

GTTCTAGATG GGGAGGGGAC GGCTTCTATG CTATGGACTA CTGGGGTCAA 1200 

GGAACCCTGG TCACCGTCTC CTCGGCCTCC ACCAAGGGCC CATCGGTCTT 1250 

CCCCCTGGCA CCCTCCTCCA AGAGCACCTC TGGGGGCACA GCGGCCCTGG 1300 

GCTGCCTGGT CAAGGACTAC TTCCCCGAAC CGGTGACGGT GTCGTGGAAC 1350 

TCAGGCGCCC TGACCAGCGG CGTGCACACC TTCCCGGCTG TCCTACAGTC 1400 

CTCAGGACTC TACTCCCTCA GCAGCGTGGT GACTGTGCCC TCTAGCAGCT 1450 

TGGGCACCCA GACCTACATC TGCAACGTGA ATCACAAGCC CAGCAACACC 1500 

AAGGTGGACA AGAAAGTTGA GCCCAAATCT TGTGACAAAA CTCACACAGG 1550 

GCCCTTCGTT TGTGAATATC AAGGCCAATC GTCTGACCTG CCTCAACCTC 1600 
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CTGTCAATGC TGGCGGCGGC TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG 1650 
GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG GGTGGCGGCT CTGAGGGAGG 1700 
CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT GAAAAGATGG 1750 
CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1800 
CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC 1850 
TGCTATCGAT GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG 1900 
GTGCTACTGG TGATTTTGCT GGCTCTAATT CCCAAATGGC TCAAGTCGGT 1950 
GACGGTGATA ATTCACCTTT AATGAATAAT TTCCGTCAAT ATTTACCTTC 2000 
CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTAGC GCTGGTAAAC 2050 
CATATGAATT TTCTATTGAT TGTGACAAAA TAAACTTATT CCGTGGTGTC 2100 
TTTGCGTTTC TTTTATATGT TGCCACCTTT ATGTATGTAT TTTCTACGTT 2150 
TGCTAACATA CTGCGTAATA AGGAGTCT 217 8 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 
20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 
35 40 45 

Cys Arg Ala Ser Gin Asp Val Asn Thr Ala Val Ala Trp Tyr Gin 
50 55 60 

Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Ser Ala Ser 
65 70 75 

Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Arg Ser 
80 85 90 

Gly Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp 
95 100 105 

Phe Ala Thr Tyr Tyr Cys Gin Gin His Tyr Thr Thr Pro Pro Thr 
110 115 120 
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Phe Giy Gin Gly Thr Lys Val Glu He Lys Arg Thr Val Ala Ala 
125 130 135 

Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser 
140 145 150 

Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg 
155 160 165 

Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly 
170 175 180 

Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
185 190 195 

Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 
200 205 210 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 
215 220 225 

Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
230 235 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Phe Asn He Lys Asp Thr Tyr He His Trp Val 
50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg He Tyr 
65 70 75 

Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val Lys Gly Arg 
80 85 90 

Phe Thr He Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ser 
110 115 120 
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Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly Gin 
125 130 135 

Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser 
140 145 150 

Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 
155 160 165 

Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
170 175 180 

Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 
185 190 195 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 
200 205 210 

Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 
215 220 225 

Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys 
230 235 240 

Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Gly Pro Phe Val 
245 250 255 

Cys Glu Tyr Gin Gly Gin Ser Ser Asp Leu Pro Gin Pro Pro Val 
260 265 270 

Asn Ala Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Glu 
275 280 285 

Gly Gly Gly Ser Glu Gly Gly Gly Ser Glu Gly Gly Gly Ser Glu 
290 295 300 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr 
305 310 315 

Glu Lys Met Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala 
320 325 330 

Asp Glu Asn Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser 
335 340 345 

Val Ala Thr Asp Tyr Gly Ala Ala He Asp Gly Phe He Gly Asp 
350 355 360 

Val Ser Gly Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala 
365 370 375 

Gly Ser Asn Ser Gin Met Ala Gin Val Gly Asp Gly Asp Asn Ser 
380 385 390 

Pro Leu Met Asn Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin 
395 400 405 
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Ser Val Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr 
410 415 420 



Glu Phe Ser lie Asp Cys Asp Lys He Asn Leu Phe Arg Gly Val 
425 430 435 

Phe Ala Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser 
440 445 450 

Thr Phe Ala Asn He Leu Arg Asn Lys Glu Ser 
455 460 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



TTCGGGCCCT TCGCTGCTCA CTATACGCGT CAGTCGACTG ACCTGCCT 48 
(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 45 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



AGCTGTGGCT TCGGGCCCGC CGCCGCGTCG ACTGGCGGTG GCTCT 4 5 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 



AGCTGTGGCT TCGGGCCCGC CCCCGCGTCG ACTGGCGGTG GCTCT 4 5 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(xi) SEQQENCE DESCRIPTION: SEQ ID NO: 30 



CTTCGCTGCT NNSNNSACCC GGCAA 25 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31 



TCGATTGCCG GGTSNNSNNA GCAGCGAAGG GCC 33 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 



GCTGCTCACT ACACCCGGCA A 21 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 



GCTGCTCACA TGACCCGGCA A 21 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 



GCTGCTCTCC ACACCCGGCA A 21 
(2) INFORMATION FOR SEQ ID NO: 35: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 



GCTGCTCTGC ACACCCGGCA A 21 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36 



GCTGCTCACA CCCGGCAA 18 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 37 



GCTGCTNNNC ACACCCGGCA A 21 
(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 



GCTGCTCACT ATACGCGTCA G 21 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 



GCTGCTCAGC ACACCCGGCA A 21 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 



GCTGCTACGC ACACCCGGCA A 21 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 



GCTGCTCACT CCCGGCAA 18 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 



GCTGCTCATC ATACCCGGCA A 21 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 



GCTGCTCACT TCCGGCAA 18 
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